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In this Issue 



As promised in this space in the June issue, this issue presents the hardware 
design of the HP Apollo 9000 Series 700 workstations. For readers who didn't see 
the June issue, the Series 700 is a group of computers that surprised the work- 
station world early last year with a huge increase in speed over what had been 
available, coupled with very competitive prices. Three factors combined to give 
the Series 700 machines their speed advantage. One is a new version of HP's 
PA-RISC architecture that includes enhancements specifically aimed at improv- 
ing workstation performance (the PA stands for precision architecture and the 
RISC stands for reduced instruction set computing). The second factor in the 
new computers' speed is a set of VLSI (very large-scale integration) integrated 
circuit chips capable of operating at clock rates up to 66 MHz. Called PCX-S, the chipset includes a 
577,000-transistor CPU, a 640,000-transistor floating-point coprocessor, and a 185,000-transistor memory 
and system bus controller. The third Series 700 speed enhancer (which was covered in the June issue) is 
a new version of the HP-UX operating system that takes advantage of the architectural enhancements 
and includes additional compiler optimizations to make programs run faster. 

In the article on page 6, two Series 700 project managers give us an overview of the system design and 
performance of the HP 9000 Models 720, 730, and 750, the three original members of this product family. 
The PCX-S chipset is the subject of the article on page 12. The article on page 23 deals with the design of 
the all-important clock system that keeps everything working together, and the article on page 26 de- 
scribes the built-in input/output subsystem, which provides several different standard interfaces, an 
audio system, and a real-time clock. Verification of the chip and system designs by computer simulation 
as described in the article on page 34 resulted in first silicon chips that were able to boot the operating 
system with no workarounds, a new record for HP. The package design of the original Series 700 work- 
stations is covered in the article on page 43, and the manufacturing engineering, which included setting 
up a new surface mount process for printed circuit board production, is discussed in the article on page 
49. There are now two entry level members of this product family in addition to the three original models. 
The system design and performance of the HP 9000 Models 705 and 710 workstations are described in 
the article on page 55. 

The HP DeskJet and DeskWriter printers are low-cost thermal inkjet printers that deliver 300-dot-per- 
inch laser-quality printing on plain paper. While the original models printed only black, a color revolution 
has been taking place in the printer marketplace, creating a demand for an affordable, plain-paper color 
printer. Engineers at HP's InkJet Components Division were given the job of developing a color print car- 
tridge for the DeskJet and DeskWriter printers, and their colleagues at the Vancouver Division were 
charged with adapting the printer platform to use the new cartridge. The color cartridge had to be inter- 
changeable with the existing black print cartridge but contain three ink reservoirs instead of just one. 
Also, because the printers accept only one cartridge, the color cartridge couldn't depend on having a 
black cartridge around to help out, and had to be able to prim a good black using only cyan, magenta, 
and yellow inks. As summarized in the article on page 64, these challenges were met, resulting in the 
new HP DeskWriter C and DeskJet 500C printers. The details of the design of the color cartridge and the 
inks are presented in the article on page 69. Manufacturing thermal inkjet print cartridges in high volume 
poses some formidable problems. The assembly process for the color cartridge is outlined in the article 
on page 77 (except for a few proprietary details). Many of the assembly steps depend on machine vision, 
as described in the article on page 87. Adhesives and encapsulants and the equipment to dispense them 
in tiny volumes to meet exacting requirements for the color print cartridge are the subjects of the article 
on page 84. To adapt the printer platforms to use the color cartridge, maior changes were made in the 
software drivers. The driver for the DeskWriter C printer for Apple Macintosh computers is described in 
the article on page 93. 
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As the designers explain it in the article on page 103, "In a typical manufacturing environment, planners 
and buyers receive MRP (materials requirements planning) reports after running a regularly scheduled 
MRP program. These reports, called action reports, show suggested actions for work orders and pur- 
chase orders to meet future demand for inventory. Planners and buyers typically have to wade through 
mountains of paper reports to determine the suggestions they would like to act on. Then, they make any 
changes on paper and manually enter these changes into an MRP package. HP MRP Action Manager 
software automates this process." The HP MRP Action Manager software design is based on the client;' 
server model The planner or buyer uses a client machine (a personal computer running the HP New- 
Wave environment) which communicates over a network with the server (an HP 3000 computer running 
the MPE/iX operating system). 

R.P Dolan 
Editor 



Cover 

The PCX-S chipset for the HP Apollo 9000 Series 700 workstations includes a CPU, a floating-point co- 
processor, and a memory and system bus controller The chips implement the PA-RISC 1.1 architecture 
and run at 66 MHz. 



What's Ahead 

Featured in the October issue will be: 

• The HP 71500A microwave transition analyzer, a new instrument architecture for pulsed RF measurements 

• The HP 4980 Series Network Advisor protocol analyzers, an expert-system-based family of instruments 
for digital network troubleshooting 

• HP Visual Engineering Environment, a software package that allows engineers to design test programs 
by drawing familiar block diagrams 

• The HP E3560 digital performance monitoring and remote test system, a system for surveillance of the 
quality of European digital telecommunications networks 

• The HP HDMP-1000 gigabit-link chipset, two easy-to-use chips that implement a 1 4-gigabaud serial 
digital data link. 
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Midrange PA-RISC Workstations with 
Price/Performance Leadership 

The HP 9000 Models 720, 730, and 750 workstations achieve exceptional 
performance ratings on industry-standard benchmarks through a 
combination of a high CPU clock rate (up to 66 MHz) and tuning of the 
subsystem, compiler, and operating system designs. This article presents 
an overview of the hardware design. 

by Andrew J. DeBaets and Kathleen M. Wheeler 



Three IIP ilOOO Scries 700 workstation computers Were 
introduced in March 1991. These products were milestones 
in three respects, first, they were the Bret IIP PA-RISC com- 
puters designed specifically as workstations (PA-RISC is 
HP's reduced instruction set computer architecture). Second, 
the product design cycle from Inception to system release 
was executed in only 18 months. Third, they surprised Ihe 
workstation world with their unprecedented combination of 
price and performance, 

Fig. I shows the three workstation models. All are consid- 
ered midrange computers, higher in performance than the 
low-end or entry-level models described in Ihe article on 
page 55, which were inlroduced later. 

The Model 720 is a 50-MHz desktop workstation. It is 
designed as a general-purpose workstation for CAD/CAE, 



scientific, and CASK applications.* It provides in a single 
enclosure a cached uniprocessor, onboard I/O. up to two 
internal SCSI 3. 5-inch hard disks. ;ui optional flexible disk 
drive, up to I28M bytes of RAM, an optional EISA (Extended 
Industry-Standard Architecture) interface, several graphics 
options, and a power supply. 

The Model 7:i0 is a (iti-MIlz general-purpose desktop work- 
station. Willi ils additional performance, il makes a very 
competitive .'ill graphics application engine. It is configured 
identically to Ihe Model 720, except that the EISA slot is 
included in the base unit. The enclosures for Ihe Models 720 
and 730 are identical and are the result of new industrial 
design specifically for these products. 

"CAD. CAE, and CASE stand lot computer-aided design. engineering, and software 
engineering, lespectively. 
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The Model "ii j< a liii-MHz deskside workstation and file 
serv er designed as a high-performance 3D graphics work- 
station or as a workstation file server. Its enclosure is a ver- 
tical tower on a pedestal. The product design was leveraged 
from the HP 9000 Series 400 workstations. On a single 
motherboard the Model 750 provides all the functionality of 
the Model 730, including a cached uniprocessor, built-in I/O. 
and EISA interface The memory cabability is increased to a 
maximum of 3&4M bytes, hi addition, the Model 750 provides 
a total of four EISA slots, two graphics options, and up to 
four mass storage devices. 

The design and implementation of these three workstations 
were undertaken with the following objectives, in order of 
priority: 

Time to market (schedule) 

Performance 

Product cost 
Development cost. 

Quality was an overriding assumption of the project. The 
following sections provide an overview of the resulting 
system design, and accompanying articles in this issue give 
further technical details of the system components. 

System Overview 

The desktop Model 720 and 730 workstations are packaged 
in a new mechanical design. One of the key features of this 
package is the ease with which modules can be removed 
and replaced. This simplifies manufacturing and field sup- 
port. The replaceable components include the CPU board, 
the graphics board, the system I/O board, the EISA expan- 
sion bus anil EISA-compatible I/O cards, the disk drives, and 
the power supply. This makes it possible to define, design, 
and deliver component upgrade products quickly. These 
products can be offered its simple "board swap" upgrades to 
the installed base. 

The deskside Model 750 workstation is packaged in a slightly 
modified HP Apollo 0000 Series 400 mechanical design. This 
existing package design offered sufficient volume, power, 
and cooling to contain all of the functionality of the Model 
730 with increased expandability. Additional graphics, 
DRAM, mass storage, and EISA cards are supported within 
this larger package. The design of the Model 750 also allows 
motherboard "board swap" upgrades, 

The industrial and product designs of the Model 720/730 and 
Model 750 packages are described in the article on page 43. 

The Model 720 provides the following features: 
50-MIIz PA-RISC CPU 

50-MHz PA-RISC 01-bit floating-point coprocessor 

12KK-byte instruction cache 

256K-byte copyhack data cache. 04 bits wide 

8M to 12SM bytes of error correct ing memory 

5-Mbyle/s SCSI II interface 

10 Mbit/s IEEE 802.3 LAN interface 

Two RS-232 modem-control serial potts 

35(l-kbyte/s bidirectional parallel port 

Audio lone generator 



• IIP-IIIL device interface for keyboard, mouse, and other 
input devices 

• One high-performance graphics slot, with a variety of 
graphics options 

• One optional industry -standard EISA slot, w ith a variety of 
I/O cards 

• Two integrated 3.5-inch 420-Mbyte fixed disk drives 

• One optional 3.5-uich PC-compatible flexible ilisk drive i m 
place of one fixed drive). 

The Model 730 increases performance and functionality over 
the Model 720 by adding the following features: 

• 06-MHz PA-RISC CPU 

• 66-MHz PA-RISC 64-bit floating-point coprocessor 

• One standard EISA slot. 

The Model 750 increases performance and functionality over 
the Model 720 by offering the following additional features: 

• 06-MHz PA-RISC CPU 

• 66-MHz PA-RISC 64-bit floating-point coprocessor 

• 256K-byte instruction cache 

• 16M to 384 M bytes of error correcting memory 

• Two high-performance graphics slots 

• Four standard EISA slots 

• Up to four mass storage devices. Two full-height 5.25-inch 
devices can be supported with two additional half bays that 
support either 3.5-inch devices or half-height 5.25-inch 
devices. Up to three of these bays can support removable 
media, and t he two half bays can also be converted to 
support a third fiill-heighl 5.25-inch mechanism. 

A configuration with entry-level graphics and EISA consists 
of four major printed circuit board assemblies in the Models 
720 and 730 and two major printed circuit board assemblies 
in the Model 750. These boards contain six fully custom 
VLSI chips and three semicustom VLSI chips developed to 
implement the features listed above. 

To meet the aggressive Schedule objectives, the quality of 
the nine core chips needed to be very high. Management set 
a very clear expectation that the chips would be "right the 
first lime" and provided resources to verify the designs thor- 
oughly before tape release. As a result of the hard work of 
the design and verification teams, there were a number of 
significant successes. The first CPU chip booted the HP-UX* 
operating system four minutes after it was installed in a pro- 
cessor board. All of the hardware was released for volume 
production only three months after the final VLSI chip 
arrived — and this VLSI chip was the heart of the system. A 
number of chips were released to production without 
changes, These successes allowed the aggressive schedule 
to be met. Additional details regarding the chips' design anil 
verification can be found in the articles on pages 12 and 34. 
respectively. 

In addition In the hardware, a common set of firmware was 
Implemented on the three products. This firmware provides 
a number Of functions, including self-test, booting, and con- 
sole handling. A description of litis firmware is provided on 
page o. 
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Processor and Memory Subsystem 

Fig. 2 is a block diagram of I lie processor and memory sub- 
system. The processor and memory designs fur the Models 
720. 730, and 750 are identical excepi for frequency. The 
Models 720 and 730 implement the functionality on a single 
printed circuit board, the Model 750 implements the same 
design, with I/O and EISA interface, on a motherboard twice 
the size of the Model 720/730 processor board 

Thi' Model 720 CP!" clock frequency is 50 MHz and the 
Model 730/750 1 "PI 1 clock frequency is 66 MHz. Therefore, 
one instruction is executed every 20 ns and 15 ns. respec- 
tively. The clock system for the entire system is implemented 
with ECL logic and a crystal oscillator running at twice the 
('PL' frequency. Very careful attention was paid in the design 
of the printed circuit boards to ensure that the clocks are 
synchronous at the key components of the system. Delays 
were calculated and adjusted in the clock trace lengths. The 
article on page 23 describes the design details of the clock 
system. 

The uniprocessor design uses a VLSI CPU chip whose archi- 
tecture is referred to as PCX-S (see article, page 12). The 
PCX-S chip is fabricated in I IPs CMOS26 process and is 



housed in a 408-pin Ceramic PGA (pin-grid array). The pro- 
cessor architecture is 32 bits wide. Key features include 
cache com rollers onboard, separate TLBs (translation look- 
aside buffers) for data and instruction memory management, 
a five-stage pipeline, and test and diagnostics support. 

This is the first PA-RISC processor design optimized for 
workstation requirements. The performance modeling 
looked at key requirements for the workstation market- 
place. Both the floating-point coprocessor and the caches 
are optimized for workstation applications. The PCX-S CPU 
and floating-point coprocessor implement the PA-RISC 1.1 
architecture, which includes enhancements to the original 
PA-RISC architecture resulting from a study of workstation 
requirements by IIP and Apollo engineers. Therefore, this 
design represents many of the best practices of both IIP 
and Apollo, now both represented by the IIP Workstation 
Business Unit. 

The cache is implemented as a single-level, off-chip cache 
memory (as opposed to many industry workstations, which 
implement a primary cache on the processor chip and a sec- 
ondary cache off -chip). It is configured as 250K bytes of data 
cache and 12SK bytes of instruction cache for the Models 



128K-Byto 
Instruction 
Cache 




Note: Shaded blocks indicate functionality implemented on the Model 750 
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Fig. 2. Series 70(1 workstation proeessor and mcmor) subsystem (Stock diagram. 
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720 and 730. Hie Model 750 is configured with 256K bytes of 
both instruction cache and data cache The cache is directly 
mapped and has parity for error deledion. The instruction 
cache is 32 bits wide and is organized as lines of four double 
words (32 bytes). The data cache is 64 bits wide and is also 
organized as lines of four double words (32 bytes). Parity is 
generated for each 32-bit word in both caches. Associated 
with each line in the caches is a tag. which provides 20 bits of 
addressing information and an additional two bits of status 
32K x 8-bit and 8K x S-bit SRAMs are used in the Models 720 
and 730. while the Model 7:>0 uses 64K x 4-bit and HK X S-hit 
SRAMS, The SRAMs used to implement the Instruction and 
data caches have an access time of 15 ns for the Model ?20 
and 12 ns for the Models 730 and 750. The fast access rates 
are required so that instructions and data can be input on 
every clock cycle. 

The floating-point coprocessor was a joint design effort of 
HP and Texas Instruments, Inc. It has a 04-bit internal archi- 
tecture and has direct access to the instruction and data 
caches. It is housed in a 204-pin ceramic PGA. This chip is 
also the result of careful implementation of previously unim- 
plemented features to optimize workstation performance. 
These enhancements, described in PA-RISC 1.1 specifications, 
expand the floating-point instruction set. 

The processor communicates with the memory and I/O sub- 
systems \ia the PBus (processor bus). The 32-bil-wide PBus 
multiplexes the address and data lines. The transfer of data 
to memory and the rest of the system is controlled by a 
single-chip memory and system bus controller, fabricated in 
CMOS26 technology and housed in a 272-pin ceramic PGA. 
To optimize the efficiency of memory transfers, the width of 
the memory array was set at 144 bits ( 12S data bits and 16 
bits of error correction code). This scheme was most effec- 
tively implemented with available hardware by creating two 
banks of user-installable SIMM (single inline memory module) 
cards.* Bach card is 72 bits wide. The current SIMM imple- 
mentation uses IM-bit DRAMs (SM bytes/SIMM ) and IM-bit 
DRAMs ( HiM bytes/SIMM). The theoretical maximum mem- 
ory allowed in the system using Ki.M-bit DRAM technology is 
1(1 bytes. The systems implement a fixed number of slots for 
SIMMs (S slots in the Models 720 and 730 and 12 slots in the 
Model 7511), so the actual maximum memory is less than HI 
bytes. Each SIMM uses a proprietary memory interface chip 
to buffer the addressing of the two-bank implementation. 

The memory and system bus controller chip is also a bus 
Converter and communicates with the system bus through 
two proprietary system bus interface chips. The system bus 
is a high-bandwidth bus capable of operation up to .'{.'i MHz 
(the maximum data transfer rate is 33 Mwords/s). Since the 
PBus is a multiplexed bus and the system bus has separate 
lines for the address and data signals, the primary function 
of the system bus interface chips is to demultiplex the PBus 
to Ihe system bus and vice versa. The built-in I/< >. graphics, 
and EISA interface all attach CO the system bus. 

The memory SIMMs are a proprietory design and do not rosemDIe the mduslry stanrlaid 
memory SIMM 



HP 9000 Series 700 Workstation 
Firmware 

The frrmware for the Series 700 workstations described m trie accompanying 
article includes what is formally known as processor dependent code (PDCl and 
1/0 dependent code IIODCI his a mixture of C and PA-RISC assembly language 
code, and it resides in two !28K-0vte EPHOMs on the system Board The firmware 
ryidges the gap between the machine hardware and the operating system soft- 
ware try providing basic machine .anon, and boowrapri r.r, 
services 

The Series 700 firmware is a close cousin of the Series BOO firmware. The overall 
design and architecteo software interface Iwitn a few additions) have been pre- 
served to minimize the impact on the primary users of POC. which include the 
diagnostics, the operating system loaders, and the operating system itself 

The special contributions of the Senes 700 firmware are 

• The addition of several new procedural entry points 

• A revised user interface suitable for PA-RISC workstations 

• A strategy for booting through the EISA interface 

for the Series 700. the primary user of the system may also function as the system 
administrator Several new features were added to the firmware user interface to 
make system administration tasks at the PDC level easy and intuitive Default boot 
and console devices can Be specified through friendly module names |e g., scsi, 
eisa. graphicsl rather than the traditional HP NI0/CI0 path notation Another 
special contribution of the Series 700 firmware functionality is an optional I/O 
scan for potentially bootable devices 

Several new procedural entry points were developed foi the Series 700 to pre- 
serve the external PA-RISC view of the 1/0 subsystem as seen through the PDC 
procedural interface Unlike Hewlett-Packard's proprietary CIO and NIO. the Series 
700 1/0 modules do not reside at fixed address intervals (slotsl in 1/0 space That 
is. Series 700 1/0 modules do not necessarily reside on successive 4K boundaries 
For the Series 800. the operating system uses this fixed position knowledge to 
convert a representation of an 1/0 module tin the traditional bus converter, physi- 
cal module, device dependent layer notation) to the corresponding start address 
Many ol the PDC entry points that provide auxiliary information about 1/0 modules 
require as an input parameter this hard physical address. Rather than change The 
PDC interface land force all callers of PDC to change also), a special new entry 
point PDC. MEMORY MAP provides the necessary translation of a logical desig- 
nation of an 1/0 module to the hard physical address Thus, software need deal 
only with the logical mapping ol 1/0 modules and need not derive or have a prion 
knowledge of Ihe actual physical address of the module 

A special strategy was derived to facilitate a boot through the EISA interface, 
independent ol ihe EISA configuration lile information inherent in most PC EISA 
boot models This strategy involved a definition for PA-RISC EISA I0DC (consisting 
ol architected I0DC with a special header and an extra entry paint) lor the card ol 
interest, a new PDC entry point. PDC..EISA, which provides machine-dependent 
information for the EISA I00C. and a cooperative scheme between the PDC and 
the operating system to allow a "cold" boot through EISA (before having run the 
on-line EISA configuration utilityl This strategy is not exclusive to the Series 700 

The Series 71)0 firmware was developed on HP 9000 Model 840 and 835 cornput 
ers using the PA-RISC simulator with extensions for 1/0 modeling A complete 
simulated boot of the firmware from power-up to initial program load was 
achieved before arrival ol first silicon Simulation greatly reduced the time needed 
to bring up the firmware on the actual hardware. 

Deborah A Savage 

Software Development Engineer 

Workstation Systems Division 
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Graphics 

The Model 720/730/750 system design is optimized for graph- 
ics performance. Workstation application performance was 
a fundamental objective ol' the project and graphics is a key 
component. As a result, special features have been added to 
the memory and system bus controller chip, and the system 
bus specification is customized for graphics applications. 

The memory and system bus controller implements a core 
set of functionality to optimize and thereby accelerate 
graphics performance on the system. The two main func- 
tions are interpolation of /. buffer and frame buffer data. 
Another core function is block move transfers between 
memory and the graphics hardware. 

The drive to create a "one box on the desktop" workstation 
also led to the creation of a low-c ost, one-board, graphics 
subsystem, which occupies the graphics slot configured in 
the Models 720, 7:10, and 750. The performance of this graph- 
ics option (C'RX) exceeded expectations at 1.15 million 21) 
or 3D vectors per second (8 bit color). An article on this 
subsystem is planned for a future issue of this journal. 

Built-in I/O 

< )ne of (he object ivcs for the I/O subsystem was to use an 
industry-standard I/O bus. A number of alternatives were 
considered, and a brief, spirited, period of discussion and 
analysis led to the selection of a bus from the PC world: 
EISA (Extended Industry Standard Architecture). A bus 
adapter design, using Intel's original EISA chipset, was 
Created to connect the EISA bus to the high-performance 
system bus. 

This selection of an industry standard bus (EISA) drove a 
strategy to make the PA-RISC version of the I1P-I X operat- 
ing system conform more closely to industry standards. This 
was done by modifying the operating systems I/O subsys- 
tem to provide a software interface for drivers that is much 
more like other vendors' systems. 

The functionality provided on the EISA bus now includes 
IEEE 802.3 LAN. FDDI LAN. IBM token ring LAN, X.25, SNA, 
fast/differential SCSI, serial multiplexers, and the IIP-IB 
( IEEE 4S8. IEC 625). A number of these cards and drivers 
are provided by independent vendors. An article on the EISA 
Card design is planned for a future issue of this journal. 

The core I/O subsystem, which is configured on the system 
I/O board and is independent of the EISA controller card, 
was selected to meet 80% to 90% of workstation users' needs 
without additional I/O cards. The system I/O board provides 
two serial ports, a bidirectional parallel port, an IEEE 802.3 
LAN interface, an I1P-HIL keyboard/mouse interface, and an 
SCSI interface. This robust offering of integrated I/O meets a 
wide variety of needs. The standard SCSI interface allows 
support for disks. DAT drives. CD-ROMs, and a PC-compatible 
flexible disk drive. 

The core I/O subsystem also benefited from the move 
towards a more open operating system. This competitive 
focus from the operating system and networking teams 



provided driver support for a system I/O board design based 
on the latest high-performance industry' VLSI controllers. 
The key components are new SCSI controllers from NCR 
(53(700, used on the system I/O board, and 53C710, used on 
the EISA card) and a new LAN controller from Intel (8255)6). 
These off-the-shelf devices interface efficiently and 
inexpensively to the high-performance system bus. 

The combination of a high-performance CIV. focused I/O 
driver development, high-performance industry I/( ) control- 
lei's, and an industry-standard bus has provided impressive 
I/O performance. The Models 720. 730. and 750 provide se- 
quential disk performance eight times greater than previous- 
generation products. NFS benchmarking (which uses net- 
working and disk I/O) shows that configurations of the Model 
750 can outperform dedicated NFS file servers. Additional 
details regarding the design of the core I/O subsystem can 
be found in the article on page 20. 

Performance 

The performance of the Models 720. 730 mid 750 is the result 
of a systems approach to creating a product that addresses 
the needs of workstation applications. Although the system 
clock rate is a factor in the performance numbers for the 
products, it is not the whole story. Directed design and tun- 
ing of the operating system, compilers, and subsystems, as 
discussed in this issue and the. June 1002 issue of tins journal, 
resulted in the industry leading performance numbers of the 
Series 700 workstations. 

( )ne of the workslat ion industry's standard measures of per- 
formance is a set of benchmarks called SPECmarks from 
the Systems Performance Evaluation Cooperative. This set 
of programs is run on the target machine and the individual 
results are combined to derive an overall SPEC 'mark for the 
target machine. Table I shows Hie IIP SPEC results for pow- 
er desktop products as of February 1". I!)5i2. along with the 
more basic MIPS (millions of instructions per second) and 
MFLOPS (millions of floating-point operations per second) 
ratings. Four of the SPEC benchmarks have little floating- 
point activity and six are floating-point -intensive. The per- 
formance ratings for these categories arc listed separately 
as SPECinl and SPECfp, respectively. 



Table I 

HP 9000 Series 700 SPECmark Performance 







Model 


Model 


Model 






720 


730 


750 


SPECmark 


(ret l) 


59.5 


76.8 


77.5 


SPECinl 


(rel. 1) 


39.5 


51.2 


51.5 


SPECfp 


(rel. 1 ) 


78.3 


100.6 


101.6 


SPECinfcBe 


(rel. 2) 


36.4 


47.8 


48.1 


SPECfp92 


(rel. 2) 


58.2 


75.4 


75.0 


MIPS 




57 


76 


76 


MFLOPS (double) 


17.9 


23.7 


23.7 
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Other performance measures used to compare workstations 
are those tied to graphics performance. Two such metrics 
measure those functions contributing to X Window perfor- 
mance (principally 2D) and the manipulation of forms in 2D 
and 3D graphics applications. The metrics for the CRX option 
of the Models "20. 73(1. and 7r>0 workstations are shown in 
Table II. 

Table II 

HP 9000 Series 700CRX Graphics Performance 

Model Model Model 
720CRX 730CRX 750CRX 

8.474 10.904 1U.904 
8.07!) 10,297 10.2117 

Thousands of 3D Vectors 
per Second 

820 1,090 1,090 
1.100 1.180 1.180 



Xllperf 1.2 Overall Operation 
Xllperf 1.2 General Graphics 

3D Vector Performance 
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VLSI Circuits for Low-End and 
Midrange PA-RISC Computers 

The major VLSI chips for the HP 9000 Series 700 workstations include a 
central processing unit with 577,000 transistors, a floating-point 
coprocessor with 640,000 transistors, and a memory and input/output 
controller with 185,000 transistors. 

by Craig A. Gleason, Leith Johnson, Steven T. Mangelsdorf, Thomas O. Meyer, and Mark A. Forsyth 



Processor and memory designs for today's low and mid- 
range engineering works! al ions and multiuser systems are 
constrained by many different, sometimes conflicting, re- 
quirements, High performance is critical for very large and 
computation-intensive user applications. Increasingly, cus- 
tomers expect more cost-effective solutions, requiring the 
right balance between cost and performance. In addition, 
product requirements dictate that processor and memory 
subsystems be compact, consume little power, and require a 
minimum amount of support hardware. The VLSI ( very 
large-scale integration) circuits described in this paper were 
designed to meet these requirements and to have the 



flexibility to be used in a number of IIP PA-RISC computers, 
including members of the HP 9000 Series 700 and 800 families 
and the HP 3000 Series 900 family. 

Overview 

The core processor and memory subsystem for these prod- 
ucts is shown in Fig. 1. The major VLSI chips include a cen- 
tral processing unit (CPU), a floating-point coprocessor 
(FPC), and a memory and system bus controller (MSBC). 
Collectively, these chips are called the PCX-S chipset. Key 
features of these devices are shown below: 
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Data Cache 



Memory Array Boards 




System ft— 




mBus 


Interface 


« n 


rface 



Control 



Memory and 
System Bus 
Controller 



TTL Buffer/ 
Multiplexer 



Data 



72 



« <► 



16 



16 



32 



16 



32 



16 



System Bus Address 



System Bus Data 



Control 



BankO 



Bankl 



Fig. i. Core CPU. cache, Qoatin&poinl coprocessor, and memory subsystem block diagram. 
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CPU and Cache Design 

All processor functions except for floating-point execution 
and cache memory storage are performed by the ( PC. This 
chip interfaces to both the FPC and the cache memory ar- 
rays. It also includes an interface to the memory and 1/0 
subsystem. The CPU implements HP's PA-RISC version 1.1 
architecture, which includes extensions to the original 
PA-RISC architecture focused on enhancing workstation 
performance. 

Because time to market was a key design focus, many fea- 
tures of the PCX-S CPU design were drawn from the PCX 
chipset used in HP's high-performance multiprocessing sys- 
tems. 1 The design was modified to accommodate a lower- 
cost system more closely aligned with the needs of the 
engineering workstation and low-end multiuser computer 
markets. Where Ihe previous design employed t wo-way set- 
associalive cache memories requiring VLSI support chips, 
the new chip is designed to interface directly to TTL-I/0 
SRAM chips. Tuning of Ihe system to operate at higher fre- 
quencies with faster SRAMs was also facilitated through I his 
change. The on-chip TLB was increased in size and made 
fully associative lo allow the removal of Ihe second-level 
TLB in the original design. New features and instructions 
I Including Ihe PA-KISt 1.1 extensions) were added I" mi- 
prove user Interface performance. All proposed changes 
were prioritized according to their schedule imparl and po- 
tential performance improvement, and those that offered Ihe 
besi ratio of performance improventcnl to design effort 
were chosen. Because of this schedule-driven choice of fea- 
tures and the high leverage from the previous chip, only 
fourteen months were required to reach first silicon. 

Physically, the CPC is divided into seven major blocks: two 
data paths, two memory managemenl units, a cache Control 
programmable logic array (PLA), a pipeline control PI.A, and 
an inpul/ouipul block containing Ihe various interfaces to 
cache memory storage and Ihe memory and system bus con- 
troller. Several small control and logic blocks composed of 
standard cells ;uul custom logic are also included. The design 
methodology used was "structured custom," where blocks 
are composed largely of library cells but are often hand tiled 
and include Custom circuils lo enhance performance. 

Phase clocks are used lo di\ ide each processor cycle inlo 
I wo halves or phases The ( PI ' includes three clock buffers, 
each fed by differential. BCL level sync inputs (see article, 
page 2:i). Tl uipuls ol all three buffers are connected on- 
chip to reduce clock distribution skew. Kach buffer oulpuls 



two complementary signals. CKl and CK2. representing the 
two clock phases. 

Most registers on the chip are const Dieted with transparent 
latches, which update their outputs while a dock input is 
true and hold the previous value while the clock input is 
false. These latches are more easily constructed in CMOS 
llian edge-lriggered latches. Another advantage of using the 
transparent latches is that in critical timing paths, the data 
tends to flow from one latch to another without watting for 
dock edges. This technique allows ihe system to run at high- 
er frequencies by spreading out timing budgets over multiple 
phases, rather than being limited by the slowest one-phase 
budget (as Lit an edge-triggered system). To avoid race con- 
ditions in queues of transparent latches because of overlap 
and distribution skew between CKl and CK2. local nonover- 
lapping clocks are created (see Fig. 'I). 

Each integer inst Diction executed by the machine is pro- 
cessed by the CPUs five-stage pipeline Five instructions 
can be processed concurrently, one in each of the v arious 
stages of execution. An instruction begins with a fetch from 
the instruction cache followed by a stage to decode the 
instruction. Branch and arithmetic stages follow, in which 
all arithmetic and logical operations are done, conditions 
are set, and branch targets are calculated. In Ihe fifth stage 
the inst Diction is completed by writing its result to a general 
register. 

Many architected registers are housed in the two data paths. 
A slack of 32 general-purpose registers acts as the first level 
of data storage. All integer operations use these registers for 
fast access to data. Load and store instructions are provided 
lo move data between the general registers anil memory. 
Kighl space registers are used lo hold the Hi-bit space identi- 
fier portion of the 48-bil segmented virtual address. Various 
control registers are included for interrupt handling, tempo- 
rary storage, data prolection. real-time operation, and other 
lasks. All of these resources are located in the Iwo dala 

paths. 

An interesting feature of ihe general registers is ihe addition 
of seven shadow registers lo aid in stale saving and restor- 
ing. Kach shadow register is located next to a companion 
general register. Anytime a trap occurs. Ihe operating sys- 
tem miisl save ihccurrciil state of the machine so I hat it can 
be properly restored once control is returned to Ihe user's 
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code. Normally (his would require many insinictions Co 
transfer the data from the general registers 10 temporary 
slorage or memory (and vice versa on returning control). All 
seven shadow registers can be updated in parallel while 
handling a trap, and likewise, the contents of the corre- 
sponding general registers can be restored simultaneously 
from their shadow registers on returning from the interrupt 

Execution Unit. Al the heart of the CPU is the execution unit, 
which handles arithmetic and data manipulation operations. 
II consists of an arithmetic/logic unit (ALU), a shift/merge 
unit (SMI') to perform shifting and bit-field extraction opera- 
tions, and various registers to feed these circuits with data 
and collect the results. Other processing resources include 
an adder for calculating branch target addresses. 

The ALL 1 is a high-speed, general-purpose arithmetic Unit 
performing addition, subtraction, and Boolean logic opera- 
tions. It includes logic to set conditions based on the resulls 
of instructions. These conditions can be tested to control 
branches) traps, and nullification. The ALL' is fed by two 
input buses. Most often the input buses are driven with data 
from the general registers, but in some cases the required 
data has not yet been written to the general registers and 
exists only in various pipeline regisiers representing the 
results of previous ALL' operations or loads. In these cases 
the data is bypassed around the general regisiers to the ALU. 
L'sing bypassing, an instruction can use the result of the 
previous instruction as one of its operands without causing 
the CPU to stall. A load followed by an immediate use of the 
load target as an operand will cause a single cycle penalty. 

Because branches are an important component of any 
instruction stream, an effective branch prediction algorithm 
is implemented. On any program-counter-relative branch 
(where a positive or negative displacement is added to the 
current program counter to calculate the branch target ). the 
CPU predicts that backward branches will be laken and 
forward branches will be untaken. Intuitively this is a good 
choice since most backward branches will be in multiple 
iterations of loops, while forward branches might be en- 
countered in case statements, for example. Correctly pre- 
dicted branches incur no pipeline penalty. An incorrectly 
predicted branch will cause a one-cycle delay. 

Cache Memories. No on-chip cache memory is provided, but 
the CPU includes full data, tag, and address interfaces to 
separate instruction and data cache memories. Each of 
these is composed of industry-standard, asynchronous, TTL- 
I/O static RAMs (SRAMs), providing a wide variety of cache 
speed and size options. Included in the cache interfaces are 
tag compare circuitry, parity encoding and decoding logic, 
store byte merging logic, and cache fill and copyback circuits. 

Given that SRAM technology is constantly improving, the 
system is designed to make the cache read cycle its critical 
path. With the 12-ns 256K-bit SRAM parts available in 1991 it 
is possible to build a 66-MIIz system with up to 25I5K bytes 
(each) of instruction and data caches. However, with the 
cache access path limiting system frequency, a growth path 
is left open to higher frequencies simply by substituting 10-ns. 
8-ns. and even faster SRAMs. An additional advantage of the 
SRAM-limited design is the opportunity to reduce the oper- 
ating frequency and save cost by using a lower-speed SRAM 



part (as in the 35-MHz IIP 9000 Model 706 workstation and 
the 32-MHz IIP 9000 Model 807 multiuser system ). 

The cache memories are Optimized for read timing, since 
writes occur less frequently in most code (remember, every 
instruction fetch involves an instruction cache read). Data 
cache fills take two cycles per double word, and data cache 
stores take three cycles, using a read-modify-write opera- 
tion. Compiler scheduling is used to eliminate penalties be- 
cause of conflicting loads and stores on the data cache. In 
cases where this is not possible, a store can result in a one- 
or-two-cycle penalty if either of the next two instructions is 
a load. 

While filling the cache on an instruction cache miss opera- 
tion, the CPU can execute the code as it is copied into the 
cache. This "cache streaming" effectively reduces the miss 
penalty by up to eight cycles. A similar algorithm is used on 
the data cache- 
To provide the fastest possible cache access, a virtual ad- 
dress is used to index the SRAMs (this allows the CPU to 
access the cache before it has done a virtual-to-rcal address 
translation). The tag contains the real address of the data 
stored in each 32-byte cache line, as well as a "dirty" bit to 
indicate that the location has been modified and a "private" 
bit for multiprocessor support, Each processor must obtain 
a private copy of a cache location before modifying that loca- 
tion in the cache. The instruction and data caches are direct- 
mapped, again to speed up the cache access and to reduce 
the CPU pin count. To reduce bus traffic in multiprocessor 
systems, a write-back memory update policy is implemented. 

A cache access is initiated by driving an address to the 
SRAMs (see Fig. 3). Data will return (asynchronously) 
roughly one cycle later, depending on the speed of the ad- 
dress drive and the SRAM access time. The data is collected 
by transparent latch receivers on the CPU. The liming of the 
latch clock for the incoming data is critical. If the latch 
clock falls too early, data will not be valid; too late, and a 
fast SRAM can change its data before the receiver input is 
shut off. This latch dock is created by inverting the CPU 
clock that initiates the cache access and delaying it by 
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Fig. 4. i IT chip. 

means of a long printed circuit board trace. Simulations and 
characterization data were used 10 choose Ihe correct latch 
clock timing lor reliable, high-performance operation. 

Translation Lookaside Buffers. Virtual-! o-physical memory 
translation is performed by the instruction and daia transla- 
tion lookaside buffers ( TLB). A TLB contains 00 fully asso- 
ciative entries, each mapping a IK-byle page of memory. 
Each TLB also implements four "super-TLB," or block en- 
tries. These are used by software to map large contiguous 
areas of memory (up to 10.YI bytes), which would normally 
require many single-page TLB entries. One common use of 
the block entries is in mapping Ihe graphics frame buffer. 

On an instruction fetch a virtual address is issued to the 
instruction TLB. Kach entry is checked to see if it contains 
the real translation and protection information for thai ad- 
dress. In the case of a match, the real address is driven by 
the instruction TLB and compared with the real address 
of the incoming instruction from the cache lag. A similar 
process is used in the dala TLB in the case of a load or store. 

To perform a fully associative access in a 66-MHz cycle, 
the TLBs make use of several interesting circuits. Typically, 
a fully associative memory is composed of a content- 
addressable memory (CAM), which holds the "address" of 
each entry, and a random-access memory (HAM), which 
holds the data associated with that address. In Ihe case of 
the TLB. Ihe ( AM holds the virtual address of a page (space 
identifier and Virtual page number), and Ihe RAM holds Ihe 
real page number, protection identifier, access rights, and 
other information specific to Ihe entry. The virtual address 
for the current instruction or data access is driven to the 
CAM and compared with the address contained in Ihe entry. 
If there is a match, the ( AM causes Ihe associated RAM 
entry to drive its data out of the TLB. 

The TLB is required to Complete its access ill one phase 
(half cycle) so that Ihe real address can be driven to Ihe 
cache interface before the data or instruction returns from 



the c orresponding cache access. Because of the two-part 
nature of the access ( CAM/RAM I. it was necessary' to create 
a special timing clock lo complete the CAM access and initi- 
ate Ihe RAM access. This is done by building a dununy CAM 
entry that always "hits' and fires its match line. Care is taken 
in the chip layout to ensure that this entry' always evaluates 
more slowly than the others. The dummy match line is buff- 
ered to create the midphase clock that will initiate the RAM 
access. If there is a match in any of the ( "AM entries, the 
clock will cause the select line for its RAM entry to be driven 

Entry replacement in the TLB is handled by an algorithm 
that provides performance similar to a least recently used 
policy while requiring only a fraction of the overhead. Only- 
one bit per entry is used. The first entry whose replacement 
bit is zero is chosen for replacement, and following Ihe op- 
eration its bit is set to one. If all replacement bits are ones 
the first entry is chosen and all other replacement bits are 
set to zero. 

HP's high-performance CMOS26 technology was chosen for 
the CPL' fabrication. CMOS26 offers three levels of alumi- 
num interconnect and low-resistance silicided polysilicon 
and diffusion layers. Typical effective gale lengths are 0,86 
micrometers for n-channel transistors and 0.88 micrometers 
for p-channel transistors. Fig. 4 is a photomicrograph of the 
CPl" chip. The chip is packaged in a 408-pin multilayer PGA 
(pin-grid array) originally designed to support the PCX chip- 
set. The PGA was modified for the PGX-S chip lo reduce 
cost and reflect lower power dissipation (8 warts at 66 MHz). 
Multiple power and ground planes are used in Ihe package 
to reduce inductance and support the high-speed Switching 
Of Over 200 outputs. Power supply bypass capacitors are 
attached directly to the ceramic substrate of Ihe PGA to 
minimize noise. 

Floating-Point Coprocessor 

The FPC chip contains Ihe hardware for executing floating- 
point insi rtiCl ions. Fig. "> is a block diagram of this chip 
and Fig. 0 is a photomicrograph. The FP( ' chip includes an 
FALL dala path, an FMI'V dala path, a large register file, ten 
instruction pipeline registers, and a gate array for control 
logic. The basic design of this chip has been described in a 
previous paper.* 

The FI'C is lightly coupled lo Ihe ( I'l and caches. Il con- 
nects directly lo the instruction cac he SRAMs. so il can de- 
code insi ructions as soon as they are fetched rallier than 
wailing for the CPU to tell it what operation needs to be 
performed. To maintain synchronism, the PPG and the GPU 
keep identical copies of the processor pipeline. The FPC's 
direct conneclion lo the data cache SRAMs helps minimize 
lalenc\ on load and store operations. 

Maximizing concurrent execution of insi ructions was an 
important goal in the FPC design. There are two fully Inde- 
pendent functional units: the fall ami the FMI'V. The 

FALl ' can be performing add or sublracl operations while 
Ihe FMI'V performs unrelated multiplies, divides, and square 
toots. To minimize processor pipeline stalls, the FI'C has a 
tWO-entry queue for instructions that cannot execute im- 
mediately because of dala dependency or resource conten- 
tion. I'p to ten instructions can be simultaneously pending 
inside the FPC. 
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Fig. 5. Floating-point c-u]>ri>" i -ssrii' lilm k diagram 
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These features help the FPC achieve a peak throughput of 
66 MFLOPs al f>(> MHz. This is very important for vectoriz- 
able applications, which have a large amotinl Of potential 
parallelism to exploit On the oilier hand, scalar applications 
have very limited parallelism potential, and for these ap- 
plications, instruction execution latency is more critical for 
performance. The FPC has much Improved latency com- 
pared to previous PA-RISC implementations. Simple add. 
subtract, and multiply operations execute in only three 
cycles. Divides require 12 cycles and square roots require 
18 cycles. 

Architectural Enhancements. A cross-functional team was 
empowered lo identify Opportunities to improve system per- 
formance through instruction set extensions. This team had 
representation from the processor, graphics, compilers, per- 
formance analysis, and architecture teams. Suggested exten- 
sions were validated by analyzing graphics and numerical 
applications, and hardware and software costs were 
considered as well. 

The enhanced PA-RISC architecture is called PA-RISC 1.1. 
Enhancements to the core PA-RISC floating-point instruction 
set include: 

• The number of lid-bit floating point registers was increased 
from 1G to 32. This was done to maximize the effectiveness 
of compilation techniques such as loop unrolling and soft- 
ware pipelining. These techniques can expose the concur- 
rency present in much floating-point code, but only if the 
number of registers is sufficient to allow the concurrent 
computations lo be interleaved. 

• Both halves of each (id-bit register can be addressed as a 
•'S2-bii (single-precision) register. This effectively increases 
the number of registers for single-precision operations to (id. 
Moreover, a single 64-bit load or store instruction can be 
used to load or store pairs of single-precision values, pro- 
vided they are contiguous and aligned in both memory and 
the register file. These improvements are particularly 
important for graphics code, which makes heavy use of 
single-precision operat ions. 



• It is common for floating-point coprocessor implementa- 
tions to have at least two independent functional units, one 
for mull iplies and one for adds and subtracts. Two new 
instructions, FMPYADD and FMPYSUB. were added to execute 
independent operations in these two functional units in par- 
allel In other words, a single FMPYADD or FMPYSUB instruction 
can specify both a multiply and an add or subtract operation. 
This allows many multiply and add or subtract Instructions 
lo be fused into a single instruction. 

• An unsigned integer multiply instruction was addetl which 
operates on registers in the floating-point register file. 
Floating-point multipliers can easily perform this operation. 




Fig. 6. floating-point coprocessor chip. 
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and i! is much faster than the formerly used millicode 
routine for multiplying nonconstant integers. 

The remaining enhancements are implementation depen- 
dent. These are used only in product dependent code like 
dynamically linked graphics libraries and the operating sys- 
icm. and are nol available directly in application binaries. 
This ensures that applications will run on future machines 
that omit or modify these extensions. 

An inverse square root ( 1/ vx) instruction was added. This is 
frequently used by graphics code (in coordinate transforma- 
tions, for example), and it is more than twice as fast as 
sequential square root and divide operations. 
Graphics clipping operations were accelerated by adding a 
12-bit queue of comparison results and a few simple instruc- 
tions to test conditions within the queue. This doubled the 
speed of the clipping operation for the most important 
"trivial accept" and "trivial reject" cases. 
A quad-word ( 128-bit ) store instruction was added. This 
allows pairs of (54-bit floating-point registers to be stored in 
five cycles instead of the six cycles it takes using two sepa- 
rate Mores. Tliis is important because many codes (like 
iiu'inory-io-meniory copy) are limited by load/store 
bandwidth. 

Memory and I/O Subsystem 

For typical application workloads. CPU efficiency is of pri- 
mary importance. C'ache-lo-nteinory transfers dominate the 
system architectures contribution to overall performance. 
The memory subsystem is optimized for low CPr-to-memory 
latency and bandwidth to match CPU requirements. 

I/O capabilities must be sufficient to support high- 
performance networking and disk transfers, and must be 
flexible and powerful enough to support easy attachment of 
a wide range of devices. Transfer latencies should be mini- 
mized, and moderate-bandwidth DMA should be supported. 
I/O bus protocols should be simple and use standard signal 
levels for ease of access. 

A high -performance user Interface is the hallmark of work- 
stations. High-speed transfers between I he ( 'PI ' or memory 
and the user interface provide I he basis for good windowing 
graphics performance. Main-memory Z-buli'er capability and 
interpolation support in the memory controller yield high- 
performance, low-cost MI) graphics. 

Basic Architecture. The above requirements led to the block- 
diagram shown in Fig. I. The CPU is directly connected to 
the memory and system bus controller by a 32-bit full- 
frequency bus (PMus). The memory subsystem width is 128 
bits plus 16 error detection and correction bits. Standard 
TTI. is used to multiplex this down to 72 bits into the 
memory and system bus controller. On the other side of the 
memory and system bus controller is the system bus. a non- 
multiplexed, ."12-bil -address. :(2-bii-daia proprietary design. 

Three ASICs (application-specific integrated circuits) were 
designed for the memory and system bus eoni roller subsys- 
tem. The main control ASIC, the memory and system bus 
controller chip (Fig. 7), is housed in a 272-pin PGA. The sec- 
ond ASIC is the system bus interface chip. Two of these 
Iti-bit-wide. custom buffer parts create the system bus. They 
are packaged in KM) pin plastic QFPS (quad flalpacks). The 



third ASIC decodes memory addresses and buffers the ad- 
dress and control lines. It is housed in a (58-pin plastic QFP. 
One buffer pan is used for each 18 DRAMs. The number in 
the system depends on the memory configuration. 

Memory and System Bus Controller Functionality. The memory 
and system bus controller supports standard 80-ns DRAMs 
of several different densities. Cache miss latency to the first 
word is 155 ns. and the peak transfer rate is 2(50 Mbytes/s. 
Two 72-bit-wide banks are multiplexed and accessed in page 
mode to provide the necessary bandwidth. 

An error detection and correction scheme is employed to 
control soft errors inherent in DRAM technology. An eight - 
bit code is calculated from t>4 data bits. The code used is 
capable of correcting any single-bit error, detecting any 
double-bit error, and detecting any four-bit burst error 
resulting from a single DRAM failure. 

Memory write granularity is M bits (72 with error correction 
code), Therefore, any write of less than a double word must 
use read-modify-write cycles. Since cache transfers ;ire al- 
ways aligned on 32-byte blocks, read-modify-write cycles 
are not necessary for CPU-related transactions. 

The protocol of the system bus is fundamentally a single 
word per transfer. A 4- word-deep buffer is used to gal her 
DMA writes into double-word writes when possible, thus 
avoiding read-modify-write cycles and achieving good 
performance. 

A single 32-byte buffer is used for temporary storage of cast- 
out cache lines on their way lo memory. This buffer is con- 
trolled such thai pieces of up to two lines can be present 
during periods of high flushing activity. 

A single 32-byte instruction prefetch buffer is also imple- 
mented. Upon an instruction miss, the missing line is 
fetched, followed by the next line, which is placed in the 




Fi«. 7. Memory and system bus controller chip. 
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buffer. If the next instruction miss hits on the huffcr, its con- 
tents are passed on to the CPU and the next 32 bytes are 
fetched from memory. 

User Interface Functionality. Several features were added to 
the memory and system bus controller to improve user 
interface performance. 

User interface adapters connect to the memory and system 
bus controller via Ihe system bus shown in Fig. 1. This is a 
nonmultiplexed bus with separate 32-bit address and data 
paths. The separate address provides sufficient address 
bandwidth for the nonblock-orienled nature of some user- 
i nt erf ace-re lat ed t ransfers. 

Windowing and certain graphics functions benefit from high- 
speed transfers of data from the CPU to the user interface 
adapter. The CPU and the memory and system bus controller 
are optimized to perform these writes as quickly as possible. 

User Interface (auctions also tend to benefit from high- 
speed movement of data between the adapter and main 
memory. Special instructions and hardware were added to 
the CPU and the memory and system bus controller to move 
blocks of data to and from the user interface hardware and 
memory. It was intended that this functionality be available 
at the user level. As such it is cache coherent and uses virtual 
addressing. 

Graphics functionality is a part of the user interface that is 
frequently valuable in workstation applications. Hardware 
to support main-memory Z-buffer. Z. and color interpolation 
are part of the memory and system bus controller. Good 3D 
rendering performance can be achieved with low-cost, simple 
user interface adapters. 

Miscellaneous Functionality. EISA is the expansion bus for 
Series 700 workstations. A special mode is incoiporated in 
the system bus to improve EISA transfer bandwidth. 

The memory and system bus controller is the system bus 
host. In this role it performs arbitration, error management, 
interrupt management, and a few other housekeeping tasks. 

Verification. The memory and system bus controller was the 
last piece of the system to enter the design phase. Unlike 
most other pieces of the system, there was little leverage 
from previous design efforts. Therefore, it was imperative to 
achieve high-quality silicon on the first release. 

A great deal of effort was devoted to verification of the de- 
sign. A checker program was developed in parallel with the 
ASIC models. This checker was used for stand-alone chip and 
system-level simulations in conjunction with an extensive 
suite of test vectors to verify system behavior. 

First silicon was functional, and the operating system 
booted on the system w ithin 36 hours of receipt of wafers 
from fabrication. All three memory and system bus interface 
ASICs were released to production with only minor metal 
mask revisions. 

Technology and Architecture Choices 

RISC VLSI design teams are faced with an abundance of 
choices, ranging from selection of basic core technologies to 
Tine tuning of hundreds of implementation details to give the 



best performance on the expected customer applications. In 
making these decisions, designers must consider technology 
and market trends, core competencies and resources of the 
design team, customer requirements, schedule and cost 
goals, and scalability of the design to meet future require- 
ments. Given this complexity, it is not surprising that differ- 
ent design teams come up with radically different solutions 
to meet the needs of similar products. 

Basic design decisions for the PCX-S chip set described in 
this article were made by carefully evaluating alternatives 
against system-level cost, performance, and schedule goals. 
Inputs from manufacturing, technology partners, VLSI and 
system designers, architects, compiler and operating system 
designers, performance analysts, and customers were all 
considered in the initial investigation stage. Some of the 
more important design decisions and trade-offs made are 
discussed in the following paragraphs. 

PA-RISC Architecture Extensions 

A primary goal of the original PA-RISC (version 1.0) archi- 
tecture definition was to support scalability across all types 
of computer applications and performance requirements. 
Before the design of the PCX-S chip set. most PA-RISC im- 
plementations had focused on meeting Ihe needs of transac- 
tion processing, commercial, and general-puipose multiuser 
applications. Since PCX-S was to be an implementation also 
Optimized for workstation applications, an extensive analy- 
sis of typical customer workstation applications and their 
associated performance bottlenecks was conducted early in 
the project (see "PA-RISC Performance Modeling and Simu- 
lation," page 21 ). The results of this study suggested that a 
small number of critical extensions to the original architec- 
ture would provide significant benefits to the customer. The 
evolved version of the architecture containing these exten- 
sions is known as PA-RISC version 1 . 1 and is fully object- 
code compatible with applications developed for version 1.0. 

The architecture extensions targeted three primary areas 
of opportunity: improved performance on floating-point- 
intensive technical applications, improved performance on 
graphics-intensive and X-windovvs applications, and improved 
performance and efficiency of the virtual memory subsystem 
for lower-cost implementations. 

VLSI Technology Selection and Partitioning 

The design team quickly narrowed the list of VLSI technolo- 
gies that could simultaneously meet all of Ihe project goals 
to just one: high-speed, high-density CMOS technology. 
While the performance goals could have been easily met 
with ECL technology, the power, cost, and density penalties 
incurred would have limited low-cost applications of the 
chip set and future scalability of Ihe design. Gallium arse- 
nide not only had ECL's drawbacks, hut also had not yet 
been fully proven in a high-volume microprocessor 
manufacturing environment. 

The time-to-markct goal also played an influential role in 
technology selection. Meeting the aggressive schedule re- 
quirements for this design required extensive leverage of 
existing technologies, designer expertise, tools, circuit 
libraries, and verification suites. 
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For the CPL" chip, the design center selec ted was a previous 
PA-RISC processor design. 3 which was first fabricated in 
early 1989. This design had been optimized for larger com- 
mercial and technical multiuser systems, including the HP 
9000 Model 870 and HP 3000 Series 980 computers. The pro- 
cessor was fabricated in HP's 1.0-micrometer. three-level- 
metal. CMOS26 production technology. In addition to the 
CMOS26 technology, much of the implementation design, 
circuit libraries, and Verification suites, and the multilayer 
ceramic package design were leveraged to decrease time to 
market for the new CPL' design. 

The floating-point coprocessor posed a more difficult chal- 
lenge for the design leant. Since the previous-generation 
CMOS processor had been designed to interface with higher- 
cost and higher-power ECIi-technology floating-point units, 
direct leverage of this design was ruled out. It was also de- 
termined that the available resources could not support de- 
sign of both the CPC and the coprocessor on the aggressive 
target schedule. Fortunately, a semiconductor partner with 
high-performance CMOS floating-point technology already- 
well along in the design phase was found to codevelop the 
PCX-S floating-point coprocessor. Time to market was im- 
proved by separating the CPU and coprocessor on different 
chips, while industry-leading performance was achiev ed by 
adopting a tightly coupled design approach (floating-point 
coprocessor interfaced directly to the instruction and data 
caches) and designing matched, low-skew clock generation 
circuits on both chips. 

Like most workstation designs, the memory and I/O inter- 
face functions are implemented on a separate IC. Tin- CPl ' 
communicates with this chip over a high-speed synchronous 
bus, which is functionally and electrically identical to that 
used in other IIP PA-RISC systems. This allowed I he proces- 
sor to be prototyped in existing systems, improving the 
schedule in I he evaluation phase, and made the new design 
compatible with a variety of other memory systems opti- 
mized for different types of applications. Like the CPC and 
floating-point coprocessor designs, the memory interface IC 
design used a structured custom approach. Among other 
benefits, this allowed the inclusion of new features such as 
embedded graphics acceleration logic. 

High-Frequency Design 

Although many design techniques were used to improve 
performance of the processor on customer applications, 
none was considered as critical as designing I he chips for 
high-clock-frequency operation. Clock speed increases result 
in across-the-board performance improvements on nearly all 
compute-hound applications, while architecture and logic 
features generally benefit a smaller set of specific application 
types. 

There are three basic techniques that are commonly used to 

increase integrated circuit clock speeds: 

I "sea faster VLSI fabrication technology 

Sort or bin parts to select only those at the faster end of the 

speed distribution 

Improve the efficiency of (he design by reducing logic de- 
lays, eliminating speed limiting features, and incorporating 
hand-tuned custom circuit designs and layout in critical 
liming paths. 



The first two techniques were rejected for the CPl* chip 
design because of increased cost, schedule risk, and a more 
limited growth path. The third method, which emphasizes 
high-speed design techniques over manufacturing techniques, 
not only has the advantage of reduced cost over the other 
two alternatives, but also has an easier growth path to high- 
er frequencies with future technology scaling, since more 
efficient circuit designs will maintain their speed advantage 
in fester teclinologies. 

The designers had the advantage of availability of proto- 
types of a previous design from which the clock circuits, 
CMOS cell library, and core integer unit blocks were lever- 
aged. Detailed characterization of this device over the speci- 
fied environmental, operating, and processing conditions 
pinpointed all of the speed limiting paths and established a 
baseline operating frequency in excess of 50 MHz. By fur- 
ther liming of critical paths and detailed worst-case simula- 
tions of new features it was determined that the new CPU 
design could achieve a clock speed greater than 60 MHz for 
a significant portion of the yield distribution. This speed 
llien became a design goal and all new features considered 
were first simulated and tuned to determine if they could 
achieve the same frequency in sufficient quantities to meet 
anticipated production needs. First silicon prototypes were 
carefully analyzed and modified to improve all speed limit- 
ing paths, with the final result being 100 percent of the the 
yield distribution operating above the 66-MHz goal. The cus- 
tom, w-orst-case design approach used was found to have up 
to a twofold speed advantage in some cases over more con- 
ventional scmicustom design methodologies. High design 
productivity was maintained by resorting to handcrafted 
custom circuits only in critical liming paths and using Other 
methodologies elsewhere. 

Cache Memories 

Cache memory organization and speed ate also critical lo the 
resulling performance, cost, and schedule of a processor de- 
sign. In fact, there are many memory-intensive applications 
in which the performance of the cache memory can play a 
stronger role in determining application response time than 
the speed or pipeline efficiency of the core processor. This 
is becoming even more true as clock speeds rise, instruction 
pipelines become more efficient, and "typical" customer 
applications grow in size and memory requirements; 

After extensive analysis the design team chose to implement 
separate Instruction and data caches, both of which use a 
direct-mapped organization and are composed of industry- 
standard SRAMs external lo the CPL' and floating-point co- 
processor chips. The more conventional approach of imple- 
menting the cache memories directly on the CPC chip w as 
considered bul rejected for a number of reasons, some of 
which are outlined below. 

Performance. Larger applications gem-rally require access lo 
larger memory working sets, which translates into much 
higher cache miss ratios for a given cache size. Kach cache 
miss requires a fixed amount of overhead time to service, 
thereby reducing effective sustained performance. On-chip 
cache memories, because they are limited to much smaller 



August liniJ IIi-wIi-ii I'lU'karfl .louniul 19 



© Copr. 1949-1998 Hewlett-Packard Co. 



sizes than can be implemented with external memories, suf- 
fer significant performance degradations as the size of ap- 
plications increases. Simulations showed thai a single pri- 
mary cache of moderate size would outperform a small 
on-chip cache on real-world applications, even when the 
latter was hacked up by a larger, external, second-level 
cache. 

Frequency and Scalability. An often-quoted reason for the 
necessity of on-chip cache memories with high-performance 
RISC' CPUs is that clock speed or future scalability potential 
are limited by using external SRAMs. After reviewing SRAM 
technology trends and Simulating new cache liming designs 
it was found that, in fact, the data indicated a somewhat 
different conclusion. First, the speed of available SRAM 
technology at a given point in time has historically exceeded 
(and will probably continue to do so) the requirements of 
the fastest processors built in a similar generation of fab- 
rication tec hnology. The rate of improvement of these de- 
vices is actually increasing because of increased competi- 
tion and expanding markets. SRAMs are increasingly using 
VLSI technologies optimized for high-speed, high-density 
memory circuits to their advantage. Prices of competitive 
devices are decreasing exponentially (at rates of up to 30% 
every (i months). Second, generally speaking, most previous 
external cache memory designs were limited by timing over- 
head because of nonoptimal clocking and package designs, 
not by the speed of available SRAM technology. Overhead 
times (the portion of the cache cycle lime over and above 
the access time of the memory ) of 10 to 15 ns or more were 
found to be typical on competing designs evaluated by the 
team, effectively cutting in half the potential clock frequency 
possible with the available SRAM technology. By focusing 
on improving these bottlenecks instead of adopting a radi- 
cally different cache architecture the PCX-S design achieved 
both an industry-leading clock frequency and reduced cost 
of external SRAM components because of less aggressive 
access time requirements. The resulting design achieves a 
processor cycle time just 3 ns greater than the access time 
of the SRAMs employed. 

Manufacturability. Time to market and manufaet inability are 
also enhanced with an external primary cache design. The 
VLSI CPU chip is generally the most complex and expensive 
IC component in a system. Addition of large numbers of 
transistors to implement caches on this die can drive up 
costs and decrease manufacluring yields significantly. Also, 
fabrication technologies that are optimized for high-speed 
logic circuits are often less than optimal for implementing 
high-density memory circuits and vice versa. 

Instruction Execution Pipeline 

As a rule, competitive microprocessor designs use pipelined 
execution of instructions. The details (how each instruction 
is split between functions, number of stages, etc. ) vary con- 
siderably between designs, but all have the same basic goal 
of exploiting parallelism to allow instructions lo be issued 
and completed at a rate faster than they could he if unpipe- 
lined. The performance of different pipelines varies widely 
as a function of the efficiency of the implemenlation. The 
peak performance of a pipeline measures only the maximum 



rate at which Instructions can be dispatched under ideal 
conditions, while the average or sustained performance in- 
cludes nonideal events such as pipeline stalls because of 
code branches or hardware resource contention, measured 
while running real applications. Optimizing the latter was 
the goal for the PCX-S processor design. 

Both a conventional RISC pipeline and a superscalar imple- 
mentation were considered as viable options in I he design. 
Superpipelining was another technique included in the study 
for comparison purposes, although it can be maintained thai 
this approach is just an extension of conventional pipelining 
rather than a new pipeline implementation technique. Tech- 
niques such as branch prediction, data bypassing, cache 
streaming, and prefetching were first applied aggressively to 
the design to inc rease pipeline efficiency Also considered in 
the design decision were the impacts of choices on clock 
speed, cost, and time to market. The design team chose a 
conventional pipeline implementation that includes a num- 
ber of features to improve overall efficienc y as the best 
suited for the performance, price, and schedule goals. The 
techniques used to increase the speed and efficiency of this 
pipeline are also designed to be fully applicable to future 
superscalar implementations. 

Recent attempts to classify RISC processor designs based 
only on such features as their pipeline implementation miss 
many of the critical elements that determine the actual com- 
petitiveness of the designs. In reality, much more basic ele- 
ments such as circuit design techniques, clock speeds, 
cache memories, main memory latency, and so on are far 
more critical in determining basic competitive attributes 
(cost, performance, time to market ). 

Future Growth Path 

To maintain competitiveness in the high-performance arena. 
RISC designs must offer continual performance and cost 
improvements as the basic enabling technologies evolve. 
The industry trend for performance growth has historically 
been a doubling of high-end performance levels every 12 to 
18 months. To maintain this level of improvement on a regu- 
lar basis with a realistic level of investment in design re- 
sources and new technologies requires a significant amount 
of leverage between successive generations of designs. 

In making the fundamental design trade-offs for the PCX-S 
processor, the impact of each decision on future scalability 
was an important criterion. Some of the more important 
aspects of the design that support scalability are: 
• High-Frequency Circuit Designs. By using efficient circuit 
designs to achieve high speeds rather than pushing technol- 
ogy requirements to the limit, room is left to allow quick 
speed increases through technology scaling. The CPU design 
uses a technology featuring 1.0-um drawn transistor channel 
lengths lo achieve speeds in excess of GO MHz. At this time, 
technologies using 0.8-um geometries are widely available 
and 0.5-um technologies are on the horizon. Algorithmic 
scaling of designs into faster fabrication technologies can 
allow significant speed improvements without requiring 
extensive redesign. 
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PA-RISC Performance Modeling and Simulation 



The increasing^ competitive computer market requires mat we continue so provide 
the highest level of performance while offering competitive price/performance 
Performance is fractionally measured with industry-standard benchmarks such as 
unpack, the SPEC suites, and TPC-A As important as Benchmarks are. they are 
often too simplistic to reflect the customer s actual workload To delrver superior 
pertormancs to tr>e customer it is no longer sufficient to nase architectural and 
processor impiementaron decisions M inaustry-standaid uencltmaris alone 

Hewlett Packard has invested m the development of specialized nardware and 
software instrumentation to aid m the analysis of computer system behavior under 
actual customer workloads With product offerings in both instruments and com- 
puters. HP is highly qualified in computet systems instrumentation Today we Have 
the capability to characterize workloads fully to any level of detail, from operating 
system call graphs to pipeline interlocks Hardware instrumentation can analyze 
CPU. cache, memory, and I/O behavior nonintrusively The resulting data from 
actual customer workloads is used to drive detailed CPU models The models have 
proved extremely valuable in quantifying the potential performance gams from 
instruction level parallelism, specific processor implementation decisions, compiler 
scheduling algorithms, and operating system policy changes 

The design of the PA-RISC processor used In the HP 9000 Series 700 workstations 
was heavily influenced by information collected from customer workloads. The 
cache-tag emulator, which consists of specialized cache behavior monitoring 
hardware, was used to characterize the performance behavior of several alterna- 
tive cache designs. Information obtained from instruction traces collected from 
customer workloads was used to guide numerous pipeline trade-offs This m- 
foimation was used in analyzing branch prediction policies, instruction streaming 
during cache fill, stall on use. and instruction cache line prefetching. Instrucuon 
traces were also used to analyze several alternative TLB designs Organization, 
replacement policy, page size, and software management policies were thoroughly 
analyzed The final design incorporates a fully associative organization based on 
replacement policy that is an approximation of LRU (least recently usedl Page size 
was increased to 4K bytes, improvements were made to reduce the number of 
cycles needed to update the TLB, and new software management policies were 
developed to improve overall performance Use o' actual workload data enabled 
the processor design team to improve performance dramatically while substantially 
reducing complexity 



This approach has helped to identify the set of design trade-offs that provide the 
best value for the customer It has helped PA-RISC implementations steer clear of 
several pntalis plaguing some processor designs An example of the influence of 
-nsirumemation on design decisions is m the area of high-degree superscalar 
machines. While in theory these machines can provide substantive performance 
improvement in practice this s often not achieved Failure to consider actual 
customer workload behavior when considering superscalar Designs can lead to 
exceptionally complex designs that yield little Benefit to the customer Worse still, 
the comolexity of tiiese designs often limits the clock rate and future performance 
scaling The recently announced suoerscalar implementation of the PA-RISC archi- 
tecture' was cieated only after careful analysis of actual customer workloads 
Based on this analysis a minimal set of superscalar features was identified that 
provided the maximum benefit to the customer Limiting superscalar capability to 
this minimal set also minimized complexity and development time An additional 
benefit is that it allows a high clock rate, which is crucial to performance 

It is critically important that the performance analysis used in design decisions be 
accurate This is guaranteed through continuous verification ot perfoimance pro- 
jections The projections are compared with actual measurement results Any 
discrepancies cause careful review of the models to identify the source of the 
error and make the necessary corrections Use of this process over the past five 
years has enabled us to predict the performance of multiuser systems consistently 
within 5% For computational systems the accuracy has been somewhat better, 
generally falling in the l%-to-2% range 

In the future, computing environments will become more complex. The adoption ol 
client/server configurations and the widespread use of multiprocessors are chal- 
lenges to instruments and models. To address these new environments we have 
been identifying customer workloads to instrument and are developing instrumen- 
tation to handle multiple systems This will guarantee continued selection of 
design trade-offs that give the best value to the customer 
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Cache Memory Timing Techniques. Emphasis on improving 
I he performance of packaging and off-chip cache liming 
paths also promotes future scalability. The (30-MHz version 
of I he chip requires external SRAM cycle limes of only 12 ns 
lo operate over I he specified environmeiilal and operating 
Conditions* Cycle limes of S ns are available in off-the-shelf 
components today; these would allow much higher proces- 
sor clock speeds. Furthermore, advanced mullichip module 
packaging technology, which is not used in Ihe present de- 
sign, offers further opportunity to increase the speed of 
off-chip paths by reducing chip crossing delays and clock 
skews. 

Integration Levels. Adopting a mullichip design thai did not 
push the limits of chip densily or die si/.es enhanced both 
manufaci inability and time to market Kill lire designs si ill 
have Significant room for integration of additional features 
both lo reduce cost and to increase performance. Because 
of ihe organization and partitioning of the CPU and Qoating- 
point coprocessor chips, ii is a relatively straightforward 
process lo integrate the two functions onlo a single die. for 

instance. 

SRAM Technology Growth Path, The Chip sel was designed 
lo support external cache Memory sizes I lo 8 limes larger 



Mian those used in the presenl Series 70(1 systems without 
requiring redesign of any of Ihe VLSI components. This en- 
ables future system designers to lake advantage of Ihe in- 
dustry SRAM learning curve?, which has resulted in expo- 
ncniial price reductions and a quadrupling of component 
densities every few years, in addition lo regular speed 
improvements. 

In addition, rapid advances m compiler technology, graphics 
hardware and algorithms, architectural features, and DRAM 
technology will also contribute to maintaining exponential 
system performance increases in the years to come. Many of 
these performance growth COhCeptS have, in fact, been in- 
corporated into an evolved version of Ihe PCX-S (TI ' called 
the PA 71DII. 1 This chip was first sampled less than one year 
after introduction of the first PCXS-based systems. 
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ECL Clocks for High-Performance 
RISC Workstations 

In the HP 9000 Series 700 workstations, clock signals are distributed 
using differential ECL circuits, and the VLSI chips have CMOS inputs 
operating at ECL levels. Critical clock delay signals are routed on 50-ohm 
striplines on printed circuit board inner layers. 

by Frank J. Lettang 



In synchronous computers Ihe clock system generates the 
rhythm that makes everything work, so clock performance 
plays an important role in determining Ihe maximum oper- 
ating frequency of a high-performance workstation. Clock 
design and distribution are important system design issues 
thai separate semifast computers from extremely fast 

computers. 

To make an effective clock system it is important lo mini- 
mize both clock jitter and skew since both effects reduce 
the lime available for the computer to do useful work be- 
tween each clock tick. Component variations, electrical 
noise, temperature, and supply voltage changes all combine 
to increase the amount of jitter and skew found in a clock 
system. The clock system design must minimize these 



problems, not only in ihe clock circuils themselves but 
also in the components that use the clocks. 

In HP 9000 Series 700 workstations, high-speed clock signals 
are distributed using ECL logic because of its superior tim- 
ing characteristics and faster rise and fall times. To improve 
the performance, nearly all the ECL signals are routed dif- 
ferentially from Ihe outputs of low-skew clock distribution 
ICs. Using both Ihe signal and its complement to drive the 
differential inputs of an ECL device greatly reduces the ef- 
fects of differences between the rise and fall times. CMOS 
differential inputs operating at ECL thresholds are used on 
all ihe high-speed VLSI chips. This minimizes on-chip delay 
and skew by making the conversion to internal CMOS levels 
more efficient While all of this may seem a bit expensive, 
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Looks like 50 ohms lo 3V 



Needs to be as short as possible. 
Fig. 2. Passive differential level I raiislal ion circuit. 

Ihe performance of the clock system affects the perfor- 
mance of all of the devices I hat connect to it, and spending a 
Little more money on the clock systeni Saves a lot of money 
in other areas. 

Fig. 1 is a block diagram of the Scries TOO clock system. The 
system stalls with an ECL oscillator running ai twice the 
processor frequency. This oscillator drives the clock input of 
an ECL finite state machine ihai generates differentia] ECL 
clock signals at Ihe processor frequency and half the proces- 
sor frequency. These signals need a specific phase relal ion- 
ship. The slale machine always reverts to Ihe proper transi- 
tion pattern without the need for a resel signal. From the 
state machine, differential ECL signals drive clock 
distribution chips and ECL-tO-TTL converters. 

The state machine is implemented using a number of ECL 
high-speed edge-triggered Hip-flops. Since all Ihe flip-Hops 
are on the same die. skew and jitter are minimized. The os- 
cillator has a frequency stability of 11)0 ppm. which conlrib- 
utes only about 2 ps lo the clock system jitter. Oscillators 
with good frequency stability arc essential lo designing high 
performance clock systems. As clock speeds increase beyond 
that of Ihe current Series 700 workstations, providing yood 
frequency references will become an even more important 
challenge. For Ihe current clock system Ihe maximum mea- 
sured combined jitter and skew is about 75 ps and is largely 
the result of noise and component behavior. 

A differential ECL level translation circuit is needed be- 
tween the clock slate machine, which uses ECL referenced 
to :? volts, and the clock distribution chip used by the VLSI 
chips, which is referenced to 0 volts. This eight-resistor net- 
work (Fig. 2) properly terminates the complementary ECL 
signals at the characteristic impedance of Ihe slriplines thai 
carry them (60 ohms) while generating the input signal 
needed by the ECL clock distribution IC. Since the output 
impedance Of this nelwork is relatively high compared to 
whal can be achieved on the printed circuit board, the pas- 
sive network is located physically as close as possible lo the 
clock distribution chip. 




Input is a 600-ps-rise-tirne pulse 



Output is a slightly distorted, 
delayed and attenuated pulse. 
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Fig. .3. Tlme-domaiti response ofthe passive differential level 
translation ciri nit. 

I T sing an HP -Vl 121T lime-domain network analyzer, the re- 
sponse of this circuit to input pulses is easily verified. The 
response is shown in Fig. :i. The printed circuit board imple- 
mentation generates an edge delay of about 298 ps and de- 
grades the 700-ps ECL rise time slightly. This is largely Ihe 
result ofthe parasitic capacitance and inductance present in 
any real implementation of a level translation nelwork. Il is 
always useful to verify designs of this type with actual 
measurements and check for unexpected parasitic effects, 

The high-speed system bus uses TTL clocks running at half 
Ihe processor frequency. These clocks are generated by an 
ECL-io-TTL convener that doubles as a TO clock distribu- 
tion chip. A differential ECL delay line, implemented as a 
pair of slriplines, is placed between Ihe slale machine and 
this convener lo optimize setup and hold limes on ihe sys- 
tem bus. ECL signals are ideal for realizing highly accurate 
signal delays because of I heir crisp edges and high-quality 
terminal ions. No level translation is required on this path 
since Ihe convener expects :t \( ill -referenced ECL and thai 
is whal the clock stale machine generates. 

Clock signal quality and noise immunity are maximized by 
fouling critical clock delay lines on whal are nominally 
50-ohm stripline geometries Implemented on primed circuit 
board inner layers. The quality of these delay lines is easily 
verified using the HP 5-I121T as a lime-domain reflect omeler. 
Fig. 4 shows a typical measurement. 




Fig. 4. TIM; measurement of an untersrinaled -l.l-ns clock system 
delay One 
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Conclusion 

By carefully engineering the Series 700 clock system to mini- 
mize undesirable jitter and skew, valuable cycle time is used 
as efficiently as possible. Additionally, the clock system im- 
plements controlled delays designed to optimize performance 
of the system bus. Clock design directly translates into prod- 
uct performance, so spending extra design effort in this area 
is extremely cost-effective- 
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HP 9000 Series 700 Input/Output 
Subsystem 

Integrated on a single 8.5-by-1 1-inch I/O board is hardware support for 
the SCSI, the Centronics parallel printer interface, two RS-232 ports, the 
IEEE 802.3 LAN, the HP-HIL, four audio tone generators, and a real-time 
clock. An application-specific IC serves as I/O subsystem controller. 

by Daniel Li and Audrey B. Gore 



In today's environment of ever-increasing CPU performance, 
il is critical that I/O subsystem performance keep up with 
GPU performance, If I/O subsystem performance cannot 
keep up with CPl" performance, the system will become I/O 
bound and will not benefit from increased CPU performance. 
The goal in designing the I/O system for the HP 9000 Series 
700 workstations was to design a balanced high-performance 
system with many built-in features and yet still keep the 
system cost low. 

To increase performance, the core I/O subsystem is attached 
directly to the high-bandwidth, pipelined system bus. The 
high-speed I/O devices, such as the SCSI (Small Computer 
System Interface) and the Ethernet local area network (IAN), 
perform DMA (direct memory access) data transfers to and 
from the system memory with very low latency. This not 
only greatly reduces the chance for LAN and SCSI control- 
lers to overrun their internal buffers, but also minimizes the 
use of available system bus bandwidth and frees bandwidth 
for graphic devices and I/O expansion slots. 

A low-cost CMOS ASIC (application-specific integrated cir- 
cuit ) chip called the I/O controller implements the logic that 
controls the interface between the I/O subsystem and the 
system bus, thereby minimizing the need for interface logic. 

I/O System Features 

In the HP 9000 Series 700 workstations, a set of I/O function- 
ality is integrated on an 8.5-by-l 1-inch system I/O board. The 
following is a list of the built-in I/O system features: 
SCSI with DMA scatter/gather capability 
Parallel interface with DMA capability (bidirectional with 
HP ScanJet support ) 

Two high-performance, asynchronous RS-232 pons 

Ethernet LAN with DMA capability 

HP Human Interface Loop (HP-HIL) 

Four audio tone generators (internal and external 

capabilities) 

Two 12SK x 8-bit ROMs containing self-test. boot, console 
handler. CPU, and I/O firmware code 
Real-time clock with lithium battery backup 
8Kx 8-bit EEPROM nonvolatile memory. 

I/O Subsystem Overview 

Fig. 1 is a block diagram of the I/O subsystem. Inside the I/O 
subsystem there are two buses: a local data bus and an 



address bus. All functional blocks are located between these 
buses and use some portion of the local data bus. Depending 
on its specific requirements, a given block may or may not 
use some portion of the address bus. 

The core I/O subsystem is attached to the system bus. Data 
communication passes through a 32-bit bidirectional tristate 
register. I/O addresses pass through a 30-bit bidirectional 
tristate register, making the I/O subsystem capable of 
performing master DMA operations. 

If necessary, byte addressing during DMA operations is done 
to get into word alignment or to finish a transfer that does 
not end on a word boundary. The I/O controller chip auto- 
matically handles word alignment 

A set of bidirectional tristate buffers (74245s) attached to 
the local data bus and to the system bus interface data regis- 
ters makes word assembling and data byle disassembling 
possible. 

Bus Controller Interface 

Except for the interrupt request signal, the signals needed 
by the core I/O subsystem to interface to the memory and 
system bus controller chip are all defined in the system bus 
specification. The interrupt is asserted by the I/O controller 
chip on behalf of the devices inside the core I/O subsystem. 
The interrupt is deasserted after a GPU read to the I/O 
controller chip's interrupt register. 

Direct Memory Access (DMA) 

There are three bus masters that use DMA on the I/O subsys- 
tem board: the LAN. the parallel interface, and the SCSI. 
Inside the NCR53C700 SCSI controller, there is a bus master 
DMA device which is capable of moving data between disk 
and system memory at the rate of 27.7 Mbytes/s. This as- 
sumes the NCR chip is running at 33 MHz. has a burst size of 
24 bytes, and has an arbitration overhead of 13 system bus 
clock cycles. The Intel 82596 LAN controller also has a built- 
in high-performance DMA controller. Inside the I/O control- 
ler chip, a 32-byte FIFO register and a DMA channel support 
bidirectional parallel printer interface applications, such as 
the HP ScanJet. 

The I/O controller chip uses the system bus request and bus 
grant signals on the system bus to gain access to the bus for 
each device. If multiple bus requests occur simultaneously. 



26 AngUSI 199? ll.-wlell-FiK-k.-inl Journal 

© Copr. 1949-1998 Hewlett-Packard Co. 



Memory and 
System Bus 
Controller 



System Bus 



32Data[310] 

-9 • f- 



8 

, £7 0] 



Data Byte Steering 


Byte 




Byte 




Byte 




Byte 


and Bidirectional 


Address 0 




Address 1 




Address 2 




Address 3 


Data Buffers 


74646 




74646 




74646 




74646 




[7:0] * 






A !3i2i: 



30Address:31 2\ 

• • 9 



i i 



Control Latch 



74543 




74543 


1 


74543 


1 


74543 


1 


74543 










T 






Address 
Latches 



RS 232 »3 
(Domain 
Keyboard) 



Fig. 1. Ill' !M»'M» Sr-rifi 70(1 I/O subsystem block diagram 



August 1992 Hewlott-Pacterd Journal 27 



l Copr. 1949-1998 Hewlett-Packard Co. 



the I/O controller chip will arbitrate access for one bus 
cycle, allowing each requesting device to master the bus 
according to its priority. 

SCSI 

The SCSI (Small Computer System Interface) is a system- 
level interface bus used to connect disk drives, tape drives, 
and other I/O devices to a computer system. Numerous 
workstations today support this bus standard, and it is 
becoming the de facto disk interface standard. 

The Series 700 I/O subsystem will support the SCSI II speci- 
fication. Currently, it supports the 8-bit data bus, running 
single-ended at 5 Mbytes/s. 

The NCR53C700 intelligent SCSI controller chip is used in 
the core I/O subsystem. On the host bus side, it has an on- 
chip 32-bit DMA engine and a "script processor," which 
fetches its own commands and performs SCSI transactions 
with minimal host processor intervention. A small portion of 
the I/O controller chip is used to implement the logic that 
controls the interface between the NCR53C700 and the 
system bus. 

A concent about the SCSI compared to device-level inter- 
faces is the amount of latency the SCSI control logic adds to 
the total subsystem overhead. The Series 700 SCSI subsys- 
tem is designed to keep this overhead to a minimum. The 
script processor inside the NCR53C700 minimizes SCSI I/O 
stall latency; it takes only 500 ns to begin, compared lo 2 to 
8 ms for a traditional SCSI controller. The NCR53C700 can 
make decisions based on phase changes on the SCSI bus 
and compare specific data values. This minimizes the num- 
ber of interrupts to the processor, which may take more than 
several hundred microseconds to execute and cari be a large 
source of performance loss. 

In the HP-UX* operating system, a disk data transfer may be 
broken up and data buffers may be scattered throughout 
system memory. The latency to reinstruct the DMA opera- 
tion can result in a missed disk revolution. The performance 
degradation resulting from this data scattering is minimized 
by the scatter/gather feature of the NCR53C700 chip. 

The combination of the fast system bus transfer rale, the 
fast SCSI bus transfer rate, and the efficient architecture of 
the NCR53C700 chip enables us to achieve a high disk I/O 
transfer rate without the need to place a large private buffer 
between the SCSI controller chip and the system bus inter- 
face. This not only lowers the system cost but also avoids 
the complexity and latency of managing the buffer, thus 
maximizing the disk I/O throughput. 

Local Area Network (LAN) 

The Series 700 workstations implement a built-in LAN that 
conforms to the IEEE 802.3/Ethemet standard. The LAN 
circuitry consists of the Intel 82596DX-82C501AD chip set, 
plus a transceiver chip and associated circuitry. The Intel 
82596DX is an intelligent, high-performance 32-bit LAN con- 
troller. The 82C501AD device provides the electrical inter- 
face to the transceiver cable (AL"I or built-in Cheapemet 
MAU). generates a 10-MHz transmit clock for the LAN con- 
troller, and performs Manchester encoding and decoding of 
the transmitted and received frames. 



The 82596DX has large on-chip FIFO buffers, 128 bytes for 
receive and 64 for transmit. It also provides a four-channel 
DMA controller to communicate directly with the system 
memory via the high-performance system bus interface. The 
low memory access latency and the large on-chip FIFO prac- 
tically eliminate overrun and underrun without using an 
external FIFO or dedicated packet buffer memory, 

The 82596DX bus interface is optimized for the Intel386 
microprocessor bus. The similarity between the system bus 
and the Intel386 bus made the control circuitry to interface 
the two extremely simple; for the LAN-specific portion of the 
I/O controller chip, the total gate count is less than 100. Un- 
like some older-generation controllers, the 82596DX system 
clock rate is asynchronous to the 10-MHz Ethernet clock so 
that the controller, the I/O controller chip, and the system bus 
all run at the same frequency. For DMA operations, the I/O 
controller chip arbitrates system bus access on the 8259GDX's 
behalf, manages the address valid/ready handshake, and 
controls the address and data buffers. The I/O controller 
chip also controls access to the 82596DXs CPU port. 

The four-channel DMA controller manages memory struc- 
tures automatically using command chaining and bidirec- 
tional data chaining. This allows autonomous block data 
transfers and greatly reduces the CPU overhead. The four 
channels are: CU (transmit header). TX0 (transmit data), RU 
(receive header), and RXD (receive data). 

A permanent copy of the LAN station nodal address is kept 
in location 0 of the nonvolatile RAM (EEPROM). The on- 
board processor dependent hardware status register has 
three bits for LAN connector status: one for MAU power/ 
fuse and two for ThinLan/ThickLan selection; it also has 
three bits for the SPU ID. 

The hardware design is IEEE 802.3 compliant and Ethernet 
revision 2 compatible. A removable jumper selects either the 
ALT or BNC connectors. The LAN heartbeat is indicated by a 
front-panel LED to facilitate diagnostics. 

Bidirectional Printer Interface 

A majority of comput ers communicate with output devices 
such as printers and plotters via the Centronics parallel in- 
terface. Centronics is an 8-bit parallel, synchronous inter- 
face with additional control signals from the host computer 
and status signals from the peripheral. 

The Centronics interface was defined many years ago. With 
the passage of time, the demand for higher transfer rates 
motivated some companies to modify the Centronics stan- 
dard so that it transfers data faster in a stream mode with- 
out the overhead of a shared handshake for each data byte 
transferred. Some products' Centronics interfaces do not 
have overlapping strobe and busy handshake signals. Yet 
another variation of the Centronics interface is the bidirec- 
tional HP ScanJet interface. In the output mode, this inter- 
face is compatible with the Centronics standard. However, 
in the input mode, the peripheral becomes the master and 
the busy and strobe signals change direction. 

The Series 700 parallel interface is designed to support all 
these variations of the Centronics interface. Additional func- 
tionality is implemented by special logic built into the I/O 
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c ontroller chip, and the Western Digital \VDluC552 chip is 
used at the parallel port as the device driver/ receiver 
interface. 

The Series 700 I/O subsystem supports automatic hardware 
handshaking at the parallel interface, which removes the 
burden from software and improves performance. However, 
the software is capable of controlling all handshakes directly. 

The Series 700 hardware also adds DMA capability to the 
parallel interfac e to inc rease the performance. Inside the I/O 
controller chip, a parallel-port DMA controller and a FIFO 
buffer are implemented. The FIFO buffer supports 32-byte 
inbound and ;J2-byte outbound data transfers. The hardware 
is capable of a 380-kbyte/s data transfer rate. 

Serial Channel Communications 

There are two B§-232-C serial ports in the I/O subsystem. 
They are fully compatible with the National NSHiSoOA chip. 
Host conununicalion to the serial pons and the status and 
control registers is done through the I/O controller chip. 
Each serial port uses a 16-byte inbound and a 16- byte 
outbound FIFO data buffer to increase efficiency. 

The interface supports a 19.2-khaud inbound data transfer 
rate with no data loss, using only software XON/XOFF flow 
control. Up to 230.4 kbaud inbound is supported with no 
data loss using hardware flow control. Hardware flow con- 
trol is implemented by additional logic and controls the RTS 
line to the peripheral, thus preventing data overrun errors in 
the input FIFO buffer or input holding register. This feature 
is intended for use with high-speed serial devices that are 
capable of quickly suspending serial data transfers to the 
host when the RTS line is dropped by the host interface 
controller hardware. 

Processor Dependent Hardware 

The processor dependent hardware includes the boot ROM, 
EEPROM nonvolatile memory, status switches, and status 
LEDs. as well as the 8042 slave 1 subsystem, which consists of 
the real-time clock, audio generator, and 1IP-IIIL. 

Two sockets are available for boot ROM. Initially, each sock- 
el can hold a 128K x 8-bit ROM, but the sockets are wired so 
that they are also capable of holding ")12K-hyle pails if nec- 
essary. Thus, there are 256K bytes of boot R( >M available, 
expandable to a maximum of 1M bytes. 

The I/O subsystem has an 8K x 8-bil EEPRI )M which may be 
used for storing system configuration status and any other 
alterable, nonvolatile information. The manufacturer Of the 
EEPROM guarantees reliability of at least 10 years with a 
maximum total number of write cycles of 10.000 for any 
given byte. 

The 8042 subsystem is taken from the HP Apollo Series 400 
workstations and is composed of several devices: battery- 
backed real-lime clock, system timers, user liiners, audio 
generator, and HP Human Interface Loop (HP-HBL). These 
devices are controlled via an Intel 8042 slave microproces- 
sor which acts a server for these devices. Access to the de- 
vices is through the 8042 command/data protocol under con- 
trol of i he I/O controller chip, The HP nil. supports eight 

input devices (keyboard, mouse, tablet, elc. ) wilh a single 
•..\stem connector. 
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Fig. 2. I t ) controller ASIC chip bloc* diagram. 
Porch Board 

Since there are so many I/O connectors, a double-height 
bulkhead is needed to hold them all. The porch board is the 
means by which the signals gel to the lop level of the bulk- 
head. An 80-pin connector carries signals from the system 
board up to the porch board. The I/O connectors on the sys- 
tem I/O board are SCSI II, LAN ALU, and LAN BNC. The I/O 
connectors on the porch board are HP-HIL. audio. Centron- 
ics, and the two RS-232 pons. Other pails on the porch 
board are for ESD protection and EMI prevention. 

I/O Controller Chip 

The I/O controller chip is a 15,000-gate ASIC controller for 
the I/O subsystem. It interfaces with the memory and system 
bus controller chip and the system bus. Besides control 
(Unctions, it contains registers for subsystem Interrupts, a 
DMA channel for the parallel printer interface scan path 
logic, some parallel printer interface related logic including 
a :52-byte bidirectional FIFO buffer, system bus address de- 
coders, and other miscellaneous registers. The I/O controller 
c hip is housed in a KiO-pin PQFP (plastic quad Hat package.) 
wilh a large number of power and ground pins to minimize 
Simultaneous switching noise. TWO pins on the I/O controller 
chip are designated for scan-chain testing. The I/O controller 
chip also provides a Irislale lesl mode, which facilitates 
board production testing. In the Iristate test mode, all of the 
outputs and bidirectional pins of the I/O controller chip are 
iristated (in a high-impedance state, instead of being driven 
high or low). This lest mode helps isolate the I/O controller 
chip while testing other parts on the board. 

Fig. 2 shows the I/O controller chip block diagram. The I/O 
controller chip is I he heatl of the I/O subsystem. Its main 
functions are I/O subsystem address decoding, bus arbitra- 
tion, interrupt control, peripheral device handshaking, and 
data flow control, The I/O controller chip interfaces wilh Ihe 
memory and system bus controller chip, the system I/O con- 
troller, and the I/O subsystem devices. The I/O controller 
chip is designed lo work at frequencies from 25 to 33 Mil/ 
wilh no modifications. It provides the following resources 
and functions: 
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System bus interface and arbitral ion 

Address and daia bus transceiver control 

Western Digital serial/parallel chip control 

DMA channel for parallel printer interlace 

Intel 82596DX LAN chip control 

\ri;.->:!< Too SCSI chip control 

Intel 8042 control UlIMIIL, audio, real-time dock I 

Processor dependent hardware control (ROM, KEPROM, 

status registers, LCDs) 

Interrupt registers. 

Arbitration. The need for arbitration arises because the ad- 
dress and data buses for SCSI, LAN. and the I/O controller 
c hip are multiplexed together on the I/O system board. The 
address buses of these devices are connected together di- 
rectly. For the data buses, the I/O controller chip's 8-bit data 
bus fust goes through the steering logic (74245s) and then 
connects with the SCSI and LAN data buses. This 8-bit data 
bus is also multiplexed with the Western Digital chip 
( WD16C552) and other I/O devices. 

System Bus Interface. The system bus interface block of the 
I/O controller chip combines the graphics interface and byte 
packing functions. For master (outgoing) transactions, the 
system bus interface is responsible for enabling and clock- 
ing the external address and data buffers and the I/O con- 
troller chip address out drivers, and for managing the sys- 
tem bus address valid/readv protocol. It also handles error 
conditions. 

For slave (incoming) transactions, the system bus interface 
block detects the start of transactions to the core I/O system 
and generates the external signal for latching addresses in 
the external buffers. This is one of the most critical liming 
paths in the I/O controller chip and the core I/O system. The 
system bus protocol and tight timing make detecting and 
latching new valid core I/O addresses difficult. Furthennore. 
the address must be held steady for the length of the trans- 
action and released in lime to look at new addresses. The 
system bus interface decodes the upper eight bits of the 
incoming address CO determine if the current transaction is 
to the core I/O system. The system bus interface also con- 
trols byte packing in the core-I/O-to-system-bus direction 
and byte unpacking in the system-bus-to-core-I/O direction 
for t he 8-bil devices in t he core I/O system. 

Address Decoder. The address decoder serves two main func- 
tions: selecting slave units by decoding incoming system-bus 
addresses, and synthesizing address bits 1 and 0 for devices 
that need them. 

In general, only as much decoding as necessary is done to 
determine which unit is being addressed. Consequently, 
there is much aliasing. The Upper eight address bits do not 
come into the address decoder. These are used instead by 
the system bus stale machine, which generates a master 
select enable selecten. To pres ent select signals from being 
generated from spurious or inv alid addresses, all the de- 
codes are ANDed with the selecten signal. Since the address is 
latched externally, it is guaranteed valid and stable for the 
entire lime selecten is asserted. 

Interrupt. Inside the L'O controller chip there are an interrupt 
request register (IRR) and an interrupt mask register OMR) 
similar to the interrupt structure of the FA-RISC architecture. 



An interrupt pending register ( ll'R) is also provided. All Of 
these registers appear to be 32 bits wide and are accessed as 
such. However, only the 15 least-significant bits are Imple- 
mented for each register. The remaining bits are nol affected 
by Writes and are always read as zeros. 

The possible sources of the defined interrupts are: 

• NMI from EISA 

• 8042 general interrupts 

• 8042 high-priority interrupts 

• Reserved 

• Reserved 

• WD16C552 SKi l 

• WD16C552 SIO 2 

• WD16G552 parallel printer interface 

• LAN 

• SCSI 

• EISA 

• Graphicsl 

• Graphics2 

• SIO 

• Domain keyboard. 

The interrupt pending register (IPR) is used to latch incom- 
ing interrupts and indicate thai they are pending. The exter- 
nal interrupts are synchronized in the IPR and an active 
edge on any synchronized signal causes the corresponding 
IPR bit to be set to 1. 

The Interrupt mask register ( I.MR) is a read/write register 
used to mask pending interrupts; A 1 in an IMR bit enables 
the corresponding pending interrupt lo create an interrupt 
request 

External devices must assert interrupts for just over two 
CPU clock cycles (one I/O subsystem cycle) to be synchro- 
nized and detected. The interrupt must also deassert for at 
least two CPU clock cycles (one I/O subsystem cycle) for 
the next assertion to be recognized. 

SCSI. Because of liming and loading constraints on the sys- 
tem bus, part of the I/O controller chip is used to implement 
the control logic- between the NCE53C70Q SCSI controller 
and the system bus. The SCSI interface block primarily han- 
dles the control signals while the system bus interface con- 
trols the address and data buffers between the system bus 
and the NCR chip. 

For the address and data path, the address buffers (74543s) 
are in transparent mode for SCSI master transactions and 
in latch mode for slav e transactions. The data buffers 
(74646s) are in transparent mode for master and slave w rite 
transactions and in latch mode for master and slave read 
I ransact ions. 

LAN. Part of the I/O controller chip is also used to implement 
the control logic between the Intel 82596DX LAN Controller 
and the system bus. The LAN subbloek controls Ihe Intel 
82596DX. The slave subbloek controls slave operations 
(pon and channel attention) and master subbloek interfaces 
to the system bus on master operations. 

A slave mode transaction is one that is initialed by the pro- 
cessor. The Ihree slave mode transactions to the 82596DX 
are channel attention, port select, and hard reset. 
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When the S2696DX wants the bus for DMA master transac- 
tions, it asserts HOLD. The I/O controller chip LIOARB Mock 
arbitrates among Ihe core I/O masters (LAN. SCSI, and Cen- 
tronics DMA) and requests the system bus. The I/O control- 
ler chip then asserts HLOA to the 82596DX and lets the 
8259GDX keep the bus as long as it wants to (until HOLD is 
deasserted). The I/O controller chip also enables the address 
buffers from the core I/O system to the system bus. 

Parallel Port and DMA Thi \V. stem Digital WD16C552 chip is 
used at the parallel port as the device driver/receiver inter- 
face. The Centronics compatible features and the additional 
functionality including that required by the HP ScanJet 
products are controlled through the I/O controller chip. 

The parallel ports DMA controller inside the L'O controller 
chip is really a bidirectional data accumulator that masters 
transactions on the system bus. Its main purpose is to better 
match the relatively low-bandwidth parallel port to the high- 
performance system bus and give the port direct access to 
memory without affecting CPU performance very much. It 
buffers data for both the read and write functions of the 
built-in parallel port II collects data from the system bus 
and sends it to the parallel port and sice versa. The system 
bus connection is a 32-bit-wide bus and the parallel port 
connection is S bits wide, so in addition to dealing with the 
high-speed system bus interface the DMA controller also 
does byte packing. 

I/O Verification 

The l/< ) verification strategy was divided into two parts; a 
Stand-alone I/O system test using a tool called the bus exer- 
ciser, and a complete system test using the actual CPU, 
memory and system bus controller chip, and memory mod- 
els. The stand-alone verification Strategy will be discussed 
here. The full-system verification strategy is covered in the 
article on page 34. 

The bus exerciser is a tool that verifies the correctness of 
the I/O subsystem design in I he behavioral simulation envi- 
ronment. The bus exerciser is written in the MADI. descrip- 
tion language (see "Simulation Toolset." page 36). The bus 
exerciser is a kind Of substitute for the CPl' and Ihe memory 
and System bus controller chip ill thai il executes lest pro- 
grams and generates corresponding transactions on the 
system bus to stimulate the I/O subsystem and check the 
correctness of the response from Ihe I/< ) subsystem. 

There were three main reasons for using Ihe bus exerciser 
lo verify Ihe I/O subsystem design. First, the CPU, memory 
and system bus controller chip, and meniorv models were 
not available in the early phases of the I/O controller chip 
design. I 'sing Ihe bus exerciser eliminates the need for stable 
working models of Ihe CPU and the memory and system bus 
controller chip before testing the rather independent I/O sub- 
system. Second. Ihe bus exerciser models Ihe ( IT, Ihe inem- 
or\ ;n n I system bus controller chip, the KISA card, and Ihe 
memory subsystem in a simplified fashion. Thus .simulation 
in the bus exerciser environment is about ten times faster 
than simulation with the (TV and memory and system bus 
controller chip models. Third, when running with Ihe bus 
exerciser, we do not see cache misses and TI.15 misses. All 
the I l.MA data setup and result checking are done through 

the bus exerciser^ virtual hardware In one clock cycle rather 

than using thi- ('IT model running PA-RISC Instructions to 
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Fig. 3. Block diagram of the bus exerciser used for stand-alone I/O 
subsystem verification. 

do data setup and result checking. As a result, we can issue 
much more intense and simultaneous I/O activities on the 
system bus, Which helps stress the I/O subsystem. 

Fig. 3 is a block diagram of the bus exerciser. There are five 
main sections: 

• Instruction array (1(10 instructions deep) 

• Memory ( IK bytes) 

• System bus controller 

• Self-checking logic 

• Stress lest state machines. 

Instruction Array. To simplify Ihe design, the bus exerciser 
implements an array that holds up to 10(1 instructions. The 
Instruction array is initialized through guide vectors before 
the simulation begins. The bus exerciser can execute 1-, 2-, 
or 4-byte reads and 1-, 2-, or 4-byte pipelined or nonpipelined 
writes. 

Bach Instruction in the instruction array Is SO bits wide. A 
simple programming language was developed lo simplify 
test development The program contains several fields. The 
first field specifies whether the operation is a read or a 
write. The second field specifies the address from which lo 
read or lo which CO write. The next field Specifies the data 
expected Oil a read ar the data to write on a write operation. 
A fourth field [nigh! be used on a write operation to specify 

pipelined mode. An example or the programming language is 

as follows (the lines beginning with » are eomments I: 



August l!>!eil™| 1 .||-l'arl<u| 1 |.l..iini;il 31 



© Copr. 1949-1998 Hewlett-Packard Co. 



##### INTERRUPT TESTIint.instD 
I Read interrupt pending register 
r f080000B 0 

* Read interrupt mask register 
r (0800004 0 

# enable interrupt mask 
w (0800004 f(fftfrf p 

# master clear 
wb (082(000 0 

* read interrupt mask reg 
r (0800004 0 

This programming language is translated lo guide vectors 
using a simple HP-UX shell script. The resulting file contains 
80 bits of information: 

Hits :il:0 store the data information and bits 63:32 store the 
address Information. Bits 79:7(5 are the opcode, such as read 
word or write byte. Bits 75:72 give pipeline information for 
the write operations. Bits 71:68 give information on whether 
or not to compare the read data. On a read instruction, if Un- 
expected data is known, then that data can be included in 
the data field (bits 31:0). 

The table below further describes the instruction bits: 
inst.[79:76) 

0000 : read word 
0001 : write word 
0010 : read byte 
0011 : write byte 

0100 : read 2 bytes 

0101 : write 2 bytes 

inst[75:72] 

0000 : no-pipeline write 
0001 : pipeline write 

inst[71:68) 

0000 : no-read data compare 

0001 : read data compare 

Listed below is an example of the Bret five entries of an 
instruction array corresponding to the sample program 
above. 

input busex/inst[0] 0 %h001 0(080000800000000 
input busex/instll] 0 %h0010f080000400000000 
input busex/inst[2) 0 % h 1 1 0Of 0800004tf ffff ff 
input busex/mst[31 0 %h3100(082f00000000000 
input busex/inst|4j 0 %h001 Of 080000400000000 

Memory. Inside the bus exerciser model is a memory array 
of 1024 entries by 32 bits. This is provided mainly because 
the SCSI and LAN models need memory to perform their 
DMA operations. The memory block shared between the 
LAN and the CPU is also hardwired into the memory model 
to minimize the tedious IAN DMA setup. 

System Bus Controller. The system bus controller is the 
"brain" of the bus exerciser. The bus controller is imple- 
mented in a large, complex state machine. The bus control- 
ler fetches and executes instructions from the instruction 
array according to the system bus protocol. The bus exercis- 
er is the bus master and the I/O controller chip is the slave 
while instructions are being fetched and executed. Instruc- 
tions are fetched and executed until the instruction array is 
empty or until an I/O device requests the bus. When an I/O 



device requests the bus. the bus exerciser becomes Ihe slave 
and the I/O controller chip is the bus master. 

Listed below are the system bus functions Supported by the 
system bus controller. Some functions were described in Ihe 
"Instruction Array" section above. In this context, "host" is 
Ihe bus exerciser and "guest" is the I/O system controller. 

• One-byte, two-byte, or four-byte host -generated pipelined 

write 

• One-byte, two-byte, or four-byte host-generated nonpipe- 
lined read/wrile 

• One-byte, two-byte, or four-byte guest-generated pipelined 
read/write 

• One-byte, two-byte, or four-byte guest -general ed nonpipe- 
lined read/write 

• Interrupt handling 

• Error generation and error handling 

• Bus arbitral ion. 

Self-Checking Logic 

Extensive self-checking capability was added to the bus 
exerciser model to minimize human Intervention. For 
instance. Ihe test program can specify the expected data for 
each read Instruction. Upon completion of Ihe read transac- 
tion, the bus exerciser will check the returned data with the 
expected data and raise an error flag if an inconsistency 
occurs. 

For LAN, SCSI, and Centronics DMA operation, the bus ex- 
erciser will remember Ihe dala transferred from the memory 
lo Ihe I/O devices and laler perform self-checking while Ihe 
previously transferred dala is looped back from Ihe I/O 
devices lo memory. 

Stress Mode. Normally a test will be written in ihe bus exer- 
ciser programming language, convened lo guide vectors, 
and run on the I/O system model. The lests are relatively 
short since Ihe address queue can only hold up to 100 instruc- 
tions. These short lests are usually used to test a particular 
device, such as the HP-HIL. LAN. or SCSI 

However, situations arise in which we would like to use the 
bus exerciser lo generate I/O events randomly lo stress the 
I/O subsystem over a long period of lime. The bus exerciser 
uses stress mode lo run randomly generated programs thai 
can run for any specified length of time. The goal is for Ihe 
bus exerciser lo c reale different comer cases randomly lo 
stress the I/O Subsystem, and to perform the necessary 
self-checking without human intervention. 

Five state machines were added to the bus exerciser model 
for the stress tests. Three of Ihe slate machines are DMA 
state machines for Ihe LAN, SCSI, and Centronics devices. 
The slale machines contain blocks of code that can be put 
into the insi ruction queue and executed by the bus exercis- 
er. The blocks of code tell Ihe DMA devices in the I/O sub- 
system lo do a DMA read or write. In Ihe case of the SCSI, 
for example. Ihe bus exerciser's SCSI slale machine can 
send a block Of code to the instruction queue which, when 
executed) causes the SCSI device to perform a DMA read 
from memory. Once the SCSI device finishes the DMA read, 
it sends an interrupt back to the bus exerciser. Next, Ihe bus 
exerciser's SCSI state machine sends a block of code lo the 
instruction queue, which causes the SCSI device to perform 
a DMA write back lo memory. Once the SCSI device finishes 
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process is repeated in a loop and the program can run for 
any length of time. During each loop, the DMA. state machine 
gets its DMA transfer sizes and starting addresses randomly 
from a range of selections. An example of how the bus exer- 
ciser's SCSI state machine operates is illustrated in Fig. 4. 
The LAN and Centronics state machines behave in a similar 
fashion and all three state machines operate simultaneously 
but asynchronously. This results in a stressful simulation 
emironment. 

In addition to the three DMA state machines there are two 
other state machines that make the system even busier. 
These state machines also send blocks of code to the instruc- 
tion queue, but the instructions access non-DMA devices. 
The bus exerciser issues a highly mixed partem of instruc- 
tions to the I/O subsystem by using this combination of three 
DMA state machines and two non-DMA state machines. 

Although the bus exerciser was used as a debugging tool, it 
in no way replaced full system verification. It was still nec- 
essary to simulate operation with the CPU and the memory 
and system bus controller chip lo catch system-level bugs. 
However, more than 95% of the I/O controller chip logic bugs 
were caught in the bus exerciser emironment before inte- 
gration with the CPU and memory and system bus controller 
chip models. The result was a clean I/O controller chip that 
worked ihe first time and required no chip turnarounds. 
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Fig. 4. Operation of the bus exerciser's SCSI slale machine. 

the DMA write, it sends another interrupt back to the bus 
exerciser. Once the round I rip DMA has been completed, the 
bus exerciser's SCSI state machine compares the data read 
front memory wilh Ihe data written back lo memory. If the 
comparison is not successful, an error is flagged. This 
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Design Verification of the HP 9000 
Series 700 PA-RISC Workstations 



First a high-level system model was simulated and compared with a 
reference machine running both HP standard and pseudorandom test 
programs. Then the same tests were run on hardware prototypes. All 
chips were able to boot the operating system on first silicon. 

by Ali M. Ahi, Gregory D. Burroughs, Audrey B. Gore, Steve W. LaMar, Chi-Yen R. Lin, and 
Alan L. Wiemann 



The HP 9000 Series 700 workstations are based on a set of 
new custom VLSI chips. 1 - 2 To bring these products to mar- 
ket as quickly as possible, it was essential that these new 
chips function correctly the first time. System verification 
through simulation and prototype testing addressed this 
objective. This work proceeded in two phases. Phase 1 con- 
sisted of simulating a high-level behavioral model of the 
Series 700 system running a variety of test programs. Phase 
2 consisted of running similar tests on initial prototypes of 
the system. — 

In addition to simulation and verification of the system, each 
new VLSI chip was subjected to stand-alone testing. A be- 
havioral model of the chip was tested with vectors from 
vector generators without regard to the other chips in the 
system. Both test efforts were essential to achieving The goal 
of fully functional first silicon. This siand-alone testing was 
clone by the chip design groups and that work is not 
described here. 

The objectives for phase 1 of the system verification were 
twofold: (1) to demonstrate the correctness of the design 
according to HP standards, and (2) to find any functional 
bugs. Meeting the first objective meant ranning and passing 
the HP standard test programs (see "HP Standard PA-RISC 
Test Programs." page 35). Our strategy for achieving the 
second objective consisted of using whatever means were 
necessary to discover functional bugs in the behavioral 
models of the VLSI chips. 

The object ive for phase 2 of the system verification was to 
find any functional bugs thai were not detected during 
phase 1. The phase 2 testing was focused largely on the new 
VLSI, but also served to discover bugs in other components 
in the system, including vendor-supplied components. Most 
of the test software developed for phase 1 testing was in- 
tended for eventual use in phase 2, but underwent substan- 
tial modification to run on actual hardware. System debug- 
ging software required that pseudorandom tests be modified 
to cooperate with the debugger in control of the hardware. 
Device drivers developed for the simulation models had to 
be modified or rewritten completely to work with actual I/O 
hardware. 

Because of the considerable overhead of Simula! ing an indi- 
vidual chip in a system, simple fund ional bugs confined to a 



single chip (error in branch prediction, floating-point 
arithmetic error, etc.) are best found in stand-alone testing, 
where the chip is surrounded by only enough support logic 
to allow it to function normally. The system model is primar- 
ily used to discover bugs that only show up when two or 
more of the chips are working together. Nonetheless, several 
bugs were found in system simulation that were confined 
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Fig. 1. HP 9000 Series 700 system block diagram. 
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HP Standard PA-RISC Test Programs 

The PA-RISC vesication process involves running a set of HP standard test pro- 
grams on the model of the implementation, and creating and running code specific 
to the system. The standard code is run to ensure that the implementation mini- 
mally conforms to the architecture and to ensure that defects in previous imple- 
mentations are not round m the current implementation For our implementation, 
some of this standard code needed modification to Bring it up to date with the 
current version of the PA-flISC architecture 

AVPs 

During the verification of PA-RISC hardware a de facto standard set of test code is 
used to confirm that the implementation minimally conforms to the PA-RISC archi- 
tecture This set of test code is known as the architectural verification programs 
I AVPs) These AVPs are divided into two categories, processor and floating-point 
One of the first mapr milestones reached during verification was the behavioral 
model's successful completion of all of the processor and floating-point AVP tests 

IDCs 

Another group of de facto standard tests used during PA-RISC hardware verifica- 
tion is the implementation dependent codes IIDCsl This group of tests is com- 
posed of several small pieces of code written during the verification of previous 
PA-RISC implementations. This code usually stresses a characteristic (pipeline 
interlock, bypass, etc I or exhibits a defect found m a previous implementation 
These tests are run to ensure that the implementation being verified does not 
contain defects found in past implementations However, the characteristics that 
some of these tests were written to stress in previous implementations will not 
necessarily be exercised in the implementation currently being verified. 

Implementation-Specific Test Code 

For our verification task, we developed our own set of tests to stress the memory 
and I/O controller portions of the system Operations performed by these tests 
typically included simple I/O register reads and writes, small DMA transfers, or 
error condition creation. These tests typically did not create much activity on the 
system bus, but ensured that the basic functionality of individual pieces of the I/O 

sysitni was wuikmy LUiiBLiiy Once iiic uasii. fuiiuiuMdiily vvaa veii'ieu. iiic iBBfS 

complex and stressful testing with the BPS I/O tests (see accompanying article) 
could proceed 



Within an individual chip. The point is thai system simula- 
tion is not a substitute for stand-alone testing, but rather 
supplements it. 

Scope of the Problem 

The system verification inc luded behavioral models for the 
following chips and boards: Ihe processor chip, the floating 
point coprocessor chip, the memory/system bus controller 
Chip, the memory/system bus interface chip, the I/O control- 
ler chip, the processor board, and the system I/O board (see 
block diagram. Fig. 1 ). 

Our search for functional bugs concentrated on these chips 
and boards. The complete system model also included be- 
havioral models for cache RAM chips, memory arrays, I/O 
devices, and clock circuits. It was not our specific intent to 
discover bugs in these models, but many were found as a 
consequence of our testing. 

Challenges 

A challenge was presented by several enhancements included 
in the advanced PA-RISC 1.1 architecture of the new work- 
si at ion. This new revision of the archil eel tire required sub- 
stantial changes to existing lest programs and lools. Some 
ul 1 1 it ■ new features thai required verification included block 



TLB ( translation lookahead buffer) entries, new registers 10 
reduce CPI (cycles per instruction I on system interrupts, 
and new floating-point instructions. 

Another challenge was presented by the geographical dis- 
tribution of people. Some of the chip design groups were 
located in Colorado, while other design groups and t he veri- 
fication group were in California This meant that we had to 
rely extensively on HP's networking capabilities to exchange 
models, test files, simulation results, and bug reports. Exten- 
sive use of telephone and videoconference facilities was 
necessary in addition to frequent travel between locations. 
In spite of the considerable barriers presented by geographi- 
cal separation, a high degree of teamwork and cooperation 
was achieved. 

Methodology 

The strategy for accomplishing the objectives consisted of 
first running and passing the HP standard test programs on 
the system model. These include tests called architectural 
verification programs (AVPs) and tests called implementa- 
tion dependent code (IDCs). These tests address only pro- 
cessor and floating-point functionality and some of the 
memory access functionality. This was followed by addition- 
al testing using a pseudorandom test technique (BPS. see 
below), augmented by individual tests to check particular 
test cases. Testing of the I/O chips required the creation of 
many hand-written tests (assembly language programs) 
which were run and passed before continuing with more 
pseudorandom I/O testing. 

Completion Criteria 

Some measure of completeness was needed both to quantify 
our objectives and to enable us to measure progress towards 
acnievmg tne objectives. This work had iwo basic comple- 
tion criteria: ( 1 ) pass all AVPs, IDCs, and newly created I/O 
tests, and (2) run some fixed amount of new pseudorandom 
tests without a bug-caused failure in any model. The second 
criterion is analogous to the 24-hour CHO (continuous hours 
of operation ) metric' used for operating system testing. For 
phase 1, the criterion was lo ran new tests on all available 
simulation engines for at least one week. For phase 2, the 
criterion was to run new tests on a prototype for 48 hours. 
The one-week and 48-hour time limits are somewhat arbi- 
trary, as is the 24-hour limit for operating system testing. 

Reference and Target Machines 

Self-checking verification programs, such as the PA-RISC 
AVPs (see below), have built-in routines that check for cor- 
rect program execut ion. Self-checking programs have many 
strengths, but suffer from one fundamental weakness: it is 
difficult to write a self-checking test for an unforeseen fail- 
ure mode. Our system verification process used a reference 
machine verification methodology to supplement the exist- 
ing self-checking verification programs. In our reference 
machine methodology, a Series 700 simulator was developed 
from the PA-RISC simulator (see below). Each verification 
program was run against the Series 700 simulator and Ihe 
behavioral model. The test was considered run successfully 
if a particular subset of the machine stale of the model 
agreed with that of the simulator at Ihe end of the two 
program executions. 
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Simulation Toolset 

HP's ChipBuster loolset and other lools were used to create and run tests on the 
Series 700 system model. 

ChipBuster is a set of tools used m create models, run simulations, do schematic 
capture, generate simulation vectors, run simulations and do artwork layout design 

MADL is the modeling language in the ChipBuster toolset that was used for many 
of the models for the Series 700 

Guide is the environment in ChipBuster that is used to compile and then run 
simulations on the MADL models. 

Piglet is a schematic capture system that is part of the ChipBuster toolset The 
captured schematics were then converted to structural models and incorporated 
into the system model for the Series 700. 

BEST is one of the simulators that is part of the ChipBuster toolset It runs under 
the Guide environment BEST is short for BEhavioral, functional Specified Timing 
simulator 

The Design Capture System (DCS) is another schematic capture system that was 
used for capturing the board level schematics on the Series 700 DCS is not in the 
ChipBuster toolset, so a program was written to convert DCS output to structural 
MADL 

The Design Database Language IDOLI is a programming language that runs under 
DCS and provides a way to access the DCS database. A DDL script was used to 
access the DCS database and turn the schematics into structural MADL models 

PLADO is a modeling language that describes the behavior of PLAs (programmable 
logic arrays). 



Wo used two techniques to compare state. In the simple 
case, a test was run on both the Series 7(H) simulator and the 
behavioral model, the final state was dumped to a file in 
both cases, and the states of the simulator and fho model 
were compared. In the more complicated case, the program 
was first mil on the simulator, an execution trace and the 
final machine state were saved and incorporated into a self- 
checking program, and the self-checking program was run 
on the behavioral model. 

PA-RISC Simulator 

The PA-RISC simulator is a functional level instruction set 
simulator for the PA-RISC architecture.' 1 The simulator runs 
programs at a rate of thousands of instructions per second, 
and is a much more pleasant environment for developing 
and debugging verification test source code than the behav- 
ioral model. The simulator is a correct implementat ion of 
the PA-RISC architecture, so it can serve as a reference 
machine for processor-oriented programs. To support verifi- 
cation of PA-RISC 1.1 processors, we made the following 
enhancements to the PA-RIS( ! simulator: 
Upgrade to PA-RISC 1.1 

Additional floating-point registers and instructions 

( ache extensions 

Page size increase from 2K to 4K bytes 

Block TLB entries 

Shadow registers 
Support for additional processor organizations 

Virtual cache and arbitrary cache indexes 

Additional I/O modeling capabilities 
Additional hooks for scheduled interrupts and I/O. 



Simulation Tools 

We used the HP ChipBuster toolset to do system simulation 
(see "Simulation Toolset," at left ), specifically 'he BEST sim- 
ulator and the Guide environment ( BEST and Guide are 
parts of the ChipBuster toolset ). However, the various parts 
of the system were modeled in different ways, and each pari 
had to be incorporated into one system model that could run 
under Guide and BEST. The system simulation model came 
from a variety of sources, and we Used a number of tools to 
create the model and to run simulations. 

Behavioral models for most of the VLSI chips were written 
in the ChipBuster MADL modeling language. Their structural 
connectivity w as added using Piglet, a schematic editing 
tool contained within ChipBuster. The VLSI chips modeled 
in this fashion included the processor chip, the floating- 
point coprocessor chip, and the memory and system bus 
controller chip. The VLSI chip models were connected by a 
lop-level structural model, which was created from a sche- 
matic of the CPU board captured by DCS — HP's Design 
Capture System. DCS is not pan of ChipBuster. but the I X 'S 
database can be accessed using a programming language 
called DDL. which runs under DCS. We used a DDL scrip! to 
change the DCS schematic to structural MADL. The VLSI 
models were given to us by the chip designers, but models 
for other parts of the CPC board were created specifically to 
Support design verification. These parts included the 
instruction and data caches, the clock circuitry, an EISA 
model, and a graphics model. 

The I/O controller chip and the I/O system were modeled 
differently from the rest of the system. The stale diagrams 
for the I/O controller were convened to stale equations using 
the PLAIx i tool, a modeling language that describes the 
behavior of PLAs (programmable logic arrays ). The state 
equations were convened to MADL structural and behavior- 
al models. The rest oi the I/O system was modeled using a 
combination of behavioral and structural models, and more 
PLAI n ' mi idols for the various I/O devices such as the S< IS) 
chip and the IAN chip. Once the models were convened to 
MADL they were attached to the rest of the system model by 
hand-editing the top-level Structural MADL model that was 
generated from DCS for the CPU board. The editing was done 
to include the top-level I/O signals, which were attached to 
the CPU board via the system bus connector. 

We had a variety of hardware resources on which to run 
system simulations. The verification group had six IIP 900(1 
Model 8:55 systems and three Model 855s for system simula- 
tion. One of the Model 885s contained the system model and 
was not used for simulation, but was used for bringing up 
now models and as an NFS ( Network File System) server. 
The other machines were NFS-mounted to the discs 
containing the model. 

Simulations were managed by means of a queuing system 
which wo developed. 
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Pseudorandom Test Technology 

I nlike other test suites which are written entirely by hand, 
the pseudorandom lest programs used for Series 700 system 
verification were created by a very powerful test generation 
tool called the Busy-system Program Synthesizer, or BPS. A 
test is a collection of rules, which are consumed by the pro- 
gram generator to create random programs. The rules define 
the kind of instructions allowed in a test. The program gen- 
erator creates programs that contain instructions defined by 
the rules in random order. One set of rules can yield many 
different programs. BPS generates virtually countless test 
programs and allows us to exercise the system exhaustively. 

The verification process is an iterative process (see Fig. 2). 
It starts with using the program generation tool BPS to 
create a test program from a collection of rules. This test 
program is then run on the target hardware model (or proto- 
type) and on the reference machine, that is. the PA-RISC 
simulator. The reference machine verification techniques 
described under "Methodology" above are applied. When 
differences between the target and reference machines oc- 
cur, the execution traces are compared and the potential 
design flaws are identified. When there are no differences 
between the target and reference machines, the test is dis- 
carded. This process is repeated until all functional bugs 
have been found. 

Why BPS? 

BPS can reduce the time it takes to develop test software. 
Traditionally, a lest was written specifically for every system 
function or group of functions under test Test program de- 
velopment was a very demanding pan of I he verification 
process. The extent to which verification could be performed 
before the design release was limited by the amount of test 
software available. As the complexity of VLSI increases and 
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new-product development time decreases, the need for a 
better and faster way to develop test programs grows. BPS 
fills this need. 

BPS also breaks coding habits. People tend to develop habits 
after doing certain things many times. Once a habit is devel- 
oped, code segments written for similar tasks lend to follow 
the same coding style. Programming habits limit the variety 
(jf operations thai are exercised on a target system. BPS 
avoids this problem. 

Thirdly, BPS can stress a target system as well as real-world 
software. At the software level, there is a boundary between 
system code, user code, and firmware code. At the hardware 
level, that distinction disappears. The code from all three 
sources is intermingled in front of the CPU. The system 
should be able to perform any combination of instructions 
under any system conditions. Software that can model the 
mixture of operations coming from all those three code 
sources is needed, especially during the development of new 
hardware, when il is not possible to tun the real operating 
system and user software. BPS satisfies litis requirement. 

Finally, debugging may be easier for BPS than it is for other 
test software or for a real operating system. Linear code is 
easier for a human to follow than repetitive code. (Hand- 
written code dial has loops can be made easier to debug by 
unfolding the loops.,) During the turn-on phase of new hard- 
ware, the effort to debug the hardware using the real operat- 
ing system is tremendous because of the amount and the 
complexity of the operating system code. BPS generates 
nonlooping (but branchy) code that stresses the system just 
as much as the real software. 

Basic BPS Concepts 

Programs generated by BPS have four basic components: 
' A system initialization routine 

• A page of random instructions 

• ( >ne or more pages of random data 
' A group of hand-coded routines. 

The random instructions are created based on the rules given 
to BPS. The random instruction page is the only part that 
varies among different programs general ed by BPS. The 
data in the random data pages is arbitrary. Random data is 
pregenerated. The same random data is included in ever} 
tesi program. The hand-coded routines handle various trap 
conditions. System initialization sets the system to certain 
slates for each test. 

BPS tests are used for verification of three major system 
functional Bolts: the processor, the floating-point coproces- 
sor, and the I/O subsystem. The I/O tests ultimately verify 
the entire system. 

Processor and Floating-Point Coprocessor Tests 
ft typical BPS CPU/fiP program has four sections (see Fig. 3)i 
initialization code, random instructions, random data, and 
I rap handlers. 

At the beginning of a program, the initialization code sets up 
the system resources (control registers, general registers, 
floating-point registers, translation lookaside buffers or 
TLBs, etc.) to Certain stales. The initialization code runs in 
real mode. It turns on the virtual mode before execution 



Fig. 2. Verification process 
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Fig. .3 Structure or a BPS test. 

switches 1<> the random rode. The necessary setup for virtu- 
al addressing mode includes TLB entries, address queues, 
the processor status word, and so on. The swiich from real 
to virtual mode takes place on the last instruction in the 
initialization code. Once in virtual mode, execution of 
instructions on the random code page continues indefinitely 
until a recovery counter trap occurs. 

BPS generates only one random code page for every pro- 
gram. All virtual instruction pages are mapped to that same 
code page. The offset part of a virtual address is Ihc offset of 
a corresponding physical page. When a virtual address goes 
across a page boundary as a result of sequential execution, 
an 1TLB (instruction TLB) miss trap occurs. The trap handler 
maps the new virtual address to the same random code page, 
and execution falls back to the top of this page. When a vir- 
tual address goes to another unmapped virtual page as the 
result of executing a branch, again the new virtual address is 
mapped to the same random code page by the trap handler. 
A virtual address may point to anywhere in the virtual space, 
but the mapped physical address never goes out of the ran- 
dom code page. The program is stopped by the recovery 
counter Irap alter some desired number of instructions are 
executed on the random code page. 

The strategy used for random instructions is also used by 
the DTLB (dala TLB) Irap handler for random data. To verify 
the TLB protection function, random access rights are used 
when inserting a new TLB entry. Thus, a TLB miss trap is 
often followed by a TLB protect ion trap. To recover from 
the protection trap, the protection trap handlers supply- 
good TLB entries and good PIDs (process IDs). The illegal 
instruction trap handler sets the IPSW to nullify the illegal or 
undefined instructions. The assist exception Irap handler 
retries the exception instructions. The recovery counter trap 
handler stops a test and also invokes a self-checking routine 
for floating-point tests. 

Target versus Reference. A lest is passed if. when the test 
stops, Ihe contents of the general registers of the target 
machine match those of the reference machine. Although 
the final state of the general registers may seem to represent 
only the operations near the end of a test, in many cases 
when the target machine operations begin to depart from die 



reference machine operations, the differences are propagated 
all the way to the end of Ihe lest, 

In this project, the floatingpoint registers were not visible 
on the targel machine. Instead, the lesi software stored all 
lloating-poinl registers into memory and compared them with 
reference values stored in another memory area. The refer- 
ence values were created during lesl program generation 
and were included as part of the test program. 

Frequency Files. BPS generates random codes according lo 
user-specified rules. The file thai holds these rules is called a 
frequency file (see Fig. 4). The rules can be general or spe- 
cific. A rule describes what PA-RISC' instructions are to be 
generated in Ihe random code. This may be a specific in- 
struction, any instruction in a particular group, or a specific 
instruction sequence. 

Instructions described in a frequency file will appear many 
limes in random order in the code generated by BPS. The 
number of occurrences is controlled by two things: the fre- 
quency and the total number of Instructions lo be generated. 
The frequency for each user-specified rule is a relative 
weight assigned by the test developer. An instruction de- 
scribed by a rule with twice the weight of Others will appear 
twice as often in Ihe resulting code. The statement icount 
followed by a number in a frequency file describes the total 
number of instructions to be creaierl for a lesl program. 

Rules for instructions that perform similar hardware opera- 
tions are grouped together to form basic tests. For example. 
ALU is a basic test category. The frequency file describes 
only ALL" (arithmetic logic unii ) instructions. The random 
code generated from I hat frequency file contains all of the 
ALU instructions described in that tile in random order. Other 
basic tests include CPU memory load. CPU memory store, 
unconditional branch, conditional branch, floating-point 
operations, floating-point memory operations, and so on. 
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t Example frequency file 
/ 

t Generate 1 page of instructions (1024 instructions) 
0 

icount 1024 
# 

t shift-and-add instructions 
I 

50.shladd |t=0 2 9-31| 

50.Shladdl |t=0 2 9-31) 

50.shladdo |t-0 2 9-31) 

I 

C an interesting load sequence 
10.| 

ldw |d= 0 s-1 b-4 t= 4 20 21 22 23 ) 

ldw Id- -16 -12 -8 -4048 12 16 s-1 b-4 t= 20 21 22 23 ) 

ldw |d- -16 -12 -8 -4 0 4 8 12 16 s-1 b-4 t- 20 21 22 23 ) 

I 

t 

t some floating-point instructions 
t 

30.fadd,DBL ( £rl-0 4-7 fr2=0 4-7 ft-4-7) 
30,fSUb,DBL lfrl-0 4-7 fr2=0 4-7 ft-4-7) 

t 

t acne store instructions {note use of macro) 
t 

30.StWS I b-0 4 d— 16 -12 -8 -4 0 4 8 12) 

30-sths I b-0 2 d— 16 -14 -12 -10 -8 -6 -4 -2 0 2 4 6 8 10 12 14 1 

30.stbs I b-0 9 | 

30. ( 

nop 

SETGR I i-Ox2200 0x2400 0x2600 0X2800 0x2a00 0X2C00 0x2e00 t-8 I 
Stwas |t=0 10-31 d— 16 -12 -8 -4 0 4 8 12 b=8 ) 

) 

30.Stbys ( b-0 9 d— 16 - 15) 

Fig. 4. An example of a frequency file 
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Debugging Tools 



In phase I of the HP 9000 Series 700 system verification, most of the debugging 
was done using the e«ecution history files of the model to compare the mode: 
with the reference machine A depiper Woe* was implemented inside the proces- 
sor behavioral mooe! It collected the addresses and data associated with the 
various stages of an instruction's execution and wrote them at the instruction's 
last pipe stage Traces of signal values were available, taken either at the pm 
oounaary or at internal nodes of the system components A custom difference 
locator was used to compare the tlow r the test programs on an instruction-^ 
instruction oasis Isee Fig I) 

In phase 2 the remote debugger RDB was used for running the test programs on 
the prototype The HP 16500 logic analyzer was used to trace system bus transac- 
tions and the HP AWAX toolset was used for cache and Pous activities RDB. tne 
remote debugger, is a program that downloads an executable file from an HP 9000 
Series 300 computer into the memory of a prototype. Its interface and control flow 
are similar to that of the PA-RISC simulator 1 AWAX is an HP internal processor 



monitor toolset for the PCX chipset family, to which the Series 700 cnips belong. 
Functionally, AWAX consists of five pans data capture board, depiper. state 
analyzer, performance analyzer, and controller 

The bus monitor is a program, used during phase 2. that takes a trace o* the system 
Dos ano provides a statistical analysis of tne activity and utilization of the bus isee 
fig a 

imtjb. mit_cache. and Similar programs initialize the TLB and registers m the 
CPU This is done to bring the model to a particular state without having to go 
through the whole simulation sequence again 

Reference 

1 D Magenheime' 'Remote Oeougger ' HewtettPack3rOJou'iwl.W 37 no 8. August 1986. 
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Tick lAspaco lAoffset Instr. DAspace DAoffset Data Result Decoded Instruction 



4750 
4790 
53230 
53690 
53730 
53750 
54570 
54590 
•** 

54610 
54630 
54650 
54690 
54730 

54790 
55250 
5 5 750 
55810 
79230 
79250 



UUUU 

uuuu 

1ABC 
1ABC 
1ABC 
1ABC 
I ABC 
1ABC 
1ABC 
1ABC 
1A8C 
1ABC 
1A8C 
1ABC 
1ABC 
1ABC 
1ABC 
1ABC 
1ABC 
FACE 
FACE 



00000000 
00000004 
00000140 
00000150 
00000154 
00000274 
00000278 
0000027C 



23E 13000 
37FF0072 
829A4012 
E8400238 
B6770008 
22200002 
36310000 
OE20119C 



00000280 083C0612 
00000284 227FF010 
00000288 36730FFE 
0000028C 0A724880 
00000294 0E321380 



00000298 
0000029C 
000002D4 
00000314 
00000090 
00000094 



9753003E 
E8134002 
E8000072 
22211000 
0C9930A3 
0C9930A5 



0000 
0000 
DADA 
DACE 
FOFO 
FOFO 
FOFO 
FOFO 
0000 
FOFO 
1ABC 
FOFO 
DADA 
FOFO 
0000 
FOFO 
DADA 
FOFO 
FOFO 
DACE 
DACE 



00003800 
00003800 
000000 IE 
BEEFCAFE 
00600084 
00100000 
00100000 
00100000 

00000001 
008FF800 
008FF800 
FF 700002 
00100000 

00000019 
00000000 
00000000 
00002800 
BEEFCAF4 
BEEFCAFO 



nnnnnnnn 
nnnnnnnn 
nnnnnnnn 

nnnnnnnn 

nnnnnnnn 
nnnnnnnn 
nnnnnnnn 
00000000 
00100020 
nnnnnnnn 
nnnnnnnn 
nnnnnnnn 
nnnnnnnn 
00000001 
00100021 
nnnnnnnn 
nnnnnnnn 
nnnnnnnn 
nnnnnnnn 
BEEFCAFE 
BEEFCAFE 



00003800 LDIl 0x3800,31 
00003839 LDO 57(31), 31 
FFFFFFFB COMBT,<,M 26.20,829a4022 



00000158 BL 
00600084 ADD I 
00100000 LDIL 
00100000 LDO 
00100000 LDWAS 



e840035c,2 
4,19,23 
0x100000,17 
0(17), 17 
0(0, 17), 28 



00000001 ADD 28,25,18 

008FF800 LDIL 0x8ff800,19 

008FFFFF LDO 2047(19), 19 
00000000 COMCLR,< 18,19,0 

00100000 STUAS 18,0(0,17) 

00000006 SUBI 31,26,19 

000002A4 BLR.N 19,0 

000002DC B,N cS0000b2 

00002800 LDIL 0x2800,17 

BEEFCAF4 LDUS.MB -4(0, 4), 3 

BEEFCAFO LDUS.MB -4(0, 4), 5 



Fig. 1. Difference locator output 
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846 11 11 


1 1 0 1011 


111111 


11111 


F0822804 FFFFFFFF ; 


HOST 


idle 


848 




847 11 11 


1 1 0 1011 


11110 1 


11111 


F0822804 FFFFFFFF ; 


HOST 


read 


F0B22804 begin 




848 11 11 


1 1 0 0111 


111110 


11111 


18181818 FFFFFFFF ; 


HOST 


read 


F0822804 , Dsync 




858 11 11 


1 1 0 0111 


111111 


11111 


18181818 E86016C0 ; 


HOST 


read 


F0822804 wait 10 




859 11 11 


0 1 0 0111 


111111 


11111 


18181818 E86016C0 ; 


HOST 


read 


F0822804 wait 11, 




866 11 11 


1 1 0 0111 


111111 


11111 


E86016CO FFFFFFFF ; 


HOST 


idle 


7 




867 11 11 


1 1 0 0000 


11110 1 


11111 


F0000000 FFFFFFFF ; 


HOST 


read 


FOOOOOOO begin 




868 11 11 


1 1 0 0111 


111110 


11111 


FOOOOOOO FFFFFFFF J 


HOST 


read 


FOOOOOOO , Dsync 




915 11 11 


1 1 0 0111 


111111 


11111 


0004C924 E80116CO J 


HOST 


read 


FOOOOOOO wait 47 




916 11 11 


0 1 0 0111 


111111 


11111 


0004C924 E80116CO J 


HOST 


read 


FOOOOOOO wait 48, 




1023 11 11 


1 1 0 0111 


111111 


11111 


E80116C0 FFFFFFFF ; 


HOST 


idle 


107 


master: 


total 


idle 


read 


write 


b read b write 


intrpts 


bus_crrs bus_master 




cycles 


cycles 


ops/cycles 


ops/cycles 


ops/cycles ops/cycles 






SGCO : 


0 


0 


0 


0 


0 0 




0 


0 0 


SGC1 : 


0 


0 


0 


0 


0 0 




0 


0 0 


EISA : 


6528106 


4638313 


20371/ 


874829/ 


0/ 0/ 


455 


0 118648 








133055 


1756738 


0 0 








CORE : 


3151 


1468 


412/ 


97/ 


0/ 0/ 




0 


0 117 








1392 


291 


0 0 








HOSt : 


6575943 


5508336 


260394/ 


10982/ 


0/ 0/ 


2066 


0 118997 








907823 


159784 


0 0 








totals: 


13107200 


10148117 


281177/ 


885908/ 


0/ 0/ 


2521 


0 237762 








1042270 


1916813 


0 0 









Ready, data = E86016C0 



Fig, 2. Bus monitor output 
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In most cases, creating a now lost category is a matter of 
writing new frequency files or combining existing ones. For 
example, ('PI " memory load and store can he combined, as 
can CPU and floating-point memory load and Store. 

Frequency file development is a peculiar form of program- 
ming. These files must be designed with care. A rather strict 
register convention was adopted to allow for address align- 
ment, procedure calls, floating-point data subsets, and other 
requirements; 

In summary, although the frequency files remain the same 
for a given test category, (he number of test programs thai 
can he generated is very' large, and it is easy to make new 
test categories. As a result, the number of lest programs 
available can grow both horizontally and vertically. 

Instrgen. Fig. 5 shows the BPS tesl generation process. The 
core pari of the BPS test generator is the program instrgen, 
which takes frequency files from a test directory and gener- 
ates the random instructions. The BPS test generator com- 
bines the random code generated by instrgen with the random 
data and hand-coded routines supplied to it. The tesl code is 
in assembly language. Il is compiled and run on the largel 
machine. 

Instrgen is independent of the PA-RISC implementation. The 
only thing instrgen knows aboul PA-RISC is the mnemonic 
forma) of the PA-RISC instruction set H doesn't know regis- 
ter dependences. (PI ' pipelines, or I he effects of instruction 
execution. Compared with other random code generator 
programs, the code general ion algorithm is simpler. 

Instrgen allows Ihe lest designer to describe a PA-RISC 
instruction using eilher Ihe instruction mnemonic or the 
instruction type. Instruction types are defined by type 
strings. Every PA-RISC instruction can be described by a 
major type followed by zero or more subtypes. For example, 
Ihe major type for Idwas is memory operation. The subtypes 
are load operation, word access, short displacement, and 
absolute. If only memory operation is Specified, the instruc- 
tions described include all memory load and store instruc- 
tions. If the type memory operation is followed by Ihe sub- 
types load operation and word access, the instructions 
described include only Idws, Idwas. Idwx. Idwax, lldws. and fldwx. 

One can also tell instrgen what values are valid for fields of an 
instruction or instruction type. Fields are named using Ihe 
same field mnemonics that are used in the processor hand- 
book. A field can receive a single value, a list of values, or a 
range of values. Instrgen randomly picks a value from the list 
or range for every instruction. Random values are assigned 
to fields thai have no specified value, list, or range. 
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Fig. 5. BPS "est generation process 



Floating-Point. Floating point tests are classified similarly to 
CPU tests, that is, as memory operations, floating-point op- 
erations (flops), and combinations of memory operations 

and flops. The frequency files for ihe basic floating-point 
tests can be combined with CPU tests lo form new tests. 

BPS floating-point lesls were designed for testing Ihe proto- 
col between Ihe CPU and the floating-point coprocessor, not 
for checking ihe correctness of ihe floating-point arithmetic 
operations. Data and register use were carefully chosen to 
avoid NaNs (not a number). Except ions occurred mostly 
during type conversion. The trap handler retries all excepted 
instructions. All floating-point instructions are included in 
this test suite. 

Input/Output Subsystem Tests 

The BPS I/O subsystem tests used in this project focused on 
the functional verification of the IIP VLSI, that is, the I/O 
controller and memory/system bus controller chips, not the 
off-the-shelf I/O controller chips or the peripheral devices. 
The purpose was to simulate heavy system bus transactions 
along with CP! '-to-mcmory -controller transactions. Using 
various I/O transactions, a lest exercises the major functions 
of Ihe I/O subsystem, that is, system bus arbitration, data 
transfer, and interrupt delivery. Typical I/O transactions are 
DMA transactions for Ihe SCSI, LAN, EISA, and Centronics 
interfaces, character transactions for I lie RS-232 and IIP-IIII, 
interfaces, and special graphics transactions for graphics. 

Like a BPS ( PI VFP test, a BPS I/O test has four basic pans: 
initialization code, random code, random data, and trap han- 
dlers. In addition, a BPS I/O test has device drivers, I/O inter- 
rupt service routines, and an I/O request server. The random 
code section of an I/< ) tesl contains a number of procedure 
calls to the I/' ) request server. Every call represents one I/O 
request The arguments passed to the I/O request server are: 
i Device type 
I/O transaction length 
Device options, 

The I/O request server checks Ihe device status and stalls 
the I/O transaction. 

Round Trips, hps i/< > tests check the correctness of every l/< > 
transact ion. Every I/O transaction from memory to a device 
is always followed by an I/O transaction in the opposite 
direction. Data transferred lo devices is kept in a dedicaled 
memory area. Another memory area is reserved for data 
transferred back from I/O devices. At the end of an I/O 
round trip, the data in these memory areas is compared. 
Unlike processor tests, the recovery counter does not deter- 
mine the length of I/O tests. Instead, a test stops when an 
error is encountered or after a specified number of I/O 
round trips. 

Queues and Interrupt Services. A tnaji >r ui >al in testing the l/< > 
subsystem is lo keep il busy. The software issues multiple 
I/O transactions, one for every device. The I/O subsystem is 
programmed lo signal the completion of an I/O transaction 
through an external CPU interrupt. When an I/O transaction 
is completed, a new transaction is started immediately. A 
request queue is maintained for every device for continuously 
generating l/< > transactions. 
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Metrics 



Throughout the course of this project a number of state variables Imetncs as a 
function of timel were collected By logging every simulation event I start, stop, 
pass/fail status) to a file, we were able to determine a number of progress-related 
variables automatically State variables were measured weekly and reported 
monthly The primary function of the weekly metrics was to provide some accuraie 
gauge of protect progress. A number of variables were measured automatically 
from simulation logs, primarily indicating progress m passing test suites on mod 
els Others indicated the s<;e and speed of simulations and tne effectiveness of 
the various test suites A record was kept of now many bugs were discovered by 
each suite of tests The results are shown in Fig. 1 

In general, test suites that were run first had the best opportunity to find the most 
bugs For example, the test suite labeled "Hand", that is, the hand-coded tests, 
was run before attempting the more complex BPS tests Consequently, that suite 
of tests had an opportunity to find the greater number of bugs, while the 
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pseudorandom BPS tests found the more subtle bugs We found that the 
pseudorandom test technique generally found more bugs with less effort 

Another metric was the average simulation speed This was used on a regular 
basis to estimate the number of simulation servers needed to complete a given 
simulation objective as well as the expected length of time needed to simulate a 
test Actual simulation speed as measured on a weekly basis as shown in Fig 2 

Simulation speed declined as more and more components were added to the 
model The actual simulation capacity varied from week to week as machines 
were added to and removed Irom the pool of available servers. 

These melrics were extremely useful for managing the aggressive schedule of 
the Series 700 system verification and provided a basis for planning future 
developments 



The life i >f an l/< 3 transaction begins in the random rode sec - 
tion. A randomly generated call to Hie l/i > request service is 
passed in an I/O transaction request. The transaction is 
stalled if the device is idle. < Mherwise. it is queued. When 
the device finishes the current request, the request at the 
trip of the queue is executed. A request represents a round- 
trip transaction and therefore I wo interrupts, one for the 
transaction from memory to device, and one for the transac- 
tion from device to memory. After the data transferred hack 
from the device is checked, the current request is complete 
and the request on I he lop of the queue is started. 

The l/( ) request server maintains the request queues. The I/O 
drivers sel up the devices and start the transaction. The ex- 
ternal interrupt handler dispatches lite interrupts to Ihe in- 
lerrupl service routines. The Interrupt service routines check 
the source and target data and keep the devices running. 

With interrupt signaling and request queuing, BPS I/O tests 
can keep multiple IA I transactions active at the same time. 
The arbitration logic in the I/O controller and memory/sys- 
tem bus controller are exercised for timesharing Ihe system 
bus for various !/< > transactions. While l/< > transactions are 
progressing, the l !PU is available for running the random 
code. As a resull, more requests are queued, How busy the 
system bus can be depends on how fast Ihe l/< ) queues grow 
and how fasl Ihe interntpl can be served. 



Gremlins. There is no reason why I/O lesis cannol be com- 
bined with t'l'l VFF tests. We did exactly that, keeping both 
l/( ) and Ihe CPU busy all Ihe time. In addition, ■ number of 
gremlins were added in the tests — for example: 
i Double-bit error 

1 Memory-conlroller-sourced low-priority machine check 
i Memory refresh rale change 

i Bus arbitration change 

Random read and write Of I/O registers 
Random Hush of ihe I cache 
lligh-priorily machine check during DMA. 

Phase 1 versus Phase 2. Because of Ihe complexity of the l/l t 
subsystem components I IAN, S( SI, EISA, graphics, etc. I 
and the lack of exact models compatible with our simulation 
environment, a simplified behavioral model was written for 
each component. These models followed Ihe interface pro- 
tocol, but did not contain the full functionality or represent 
ihe liming characteristics of every component 

The timing behav ior of these I/O component modules, unlike 
Ihe aetua] chip, was close CO that of the CPU. Thus, by the 
I line ( IT would go through scheduling a transaction (e.g., 
DMA), initiating it, and returning from the scheduling code, 
the transaction would almost be completed and the comple- 
tion interrupt would soon be receiv ed 
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For this reason, the BPS I/O phase 1 tests were run in bursi 
mode. Burst mode means lhal a number of I/O activities are 
scheduled in a queue and are all started at the same lime. 
This guarantees a greater degree of randomness and 
concurrency in the I/O activities. 

BPS I/O phase 2 was different, since we were able to sched- 
ule I/O transactions one at a time and perform other sched- 
uling or interrupt services before ihe transaction was com- 
pleted. This was because of the overhead and latency of the 
actual system components. 

Another major difference between the BPS I/O phase 1 and 
phase 2 tests was the number of I/O transactions requested. 
Phase 1 programs ran al about one cycle every two seconds 
in the simulation environment, while phase 2 programs, be- 
ing run on Ihe real hardware, ran eighty million times faster. 
Thus there were many thousands of times more I/O tests in 
phase 2 than in phase 1. 

Measurements. During BPS testing we took some measure- 
ments of the number of reads and writes on the system bus 
connection. This gave us a measure of how busy the I/O sub- 
system was during testing. On the average, for each sample 
of 1024 cycles, there were 20.58 reads and 8.74 writes, 
adding up to 29.32 reads and writes. 

For comparison, similar measurements were taken on Ihe 
system running (he HP-UX operating system. On the aver- 
age, there was less than one read or write per sample. Two 
processes were then added: nfsperf between an HP 9000 
Series 700 and a Series 300 and the XUperf benchmark on the 
Series 700. The average numbers of reads for nfsperf and 
XUperf were 14 and 13 respectively, and ihe average numbers 
of writes were 9 and 12, for totals of 23 and 25 reads and 
writes per 1024 samples. Thus the BPS testing did 17% to 
28% more reads and writes I ban the HP-UX operating system 
running nfsperf and XUperf. 

Results 

As a result of Ihe combined efforts of the system verification 
team and the chip development teams, first silicon on all 
parts had no critical bugs. The system was able lo boot the 
HP-UX operating system on first silicon with no work- 
arounds. This was a new record for IIP. 

There were also a number of useful by-products from this 
system simulation work. The architectural simulator was 
upgraded to PA-RISC' 1.1 and extensions were added to 
model external interruptions and additional I/O devices. The 
pseudorandom program generator BPS was upgraded to 
PA-RISC 1.1 and extended to handle asynchronous I/O. The 
AVPs and IDCs were also upgraded to PA-RISC 1.1 where 



needed. A significant number of testing utilities were devel- 
oped which can be used by other similar efforts. One other 
tool, a pipeline performance simulator called pipes, was de- 
veloped somewhat independently of the simulation effort 
and was furnished to the optimizer developers for liming 
their design to the Series 700. 

PA 7100 Chip 

The design teams responsible for the microprocessor chips 
in the Series 700 were also responsible for the recently an- 
nounced PA 7100 processor chip. 1 This chip is significantly 
more complex in that it is a second-degree superscalar ma- 
chine and incorporates on-chip floating-point circuitry- B 
also contains numerous other features designed lo increase 
performance. 

The same techniques were used for design verification of 
I his new chip, except I hat BPS was used to test the design 
before the standard test programs were used. The reference 
machine was also enhanced so thai it would automatically 
identify the location in the execution of Ihe pseudorandom 
tests where an error was likely to have occurred in Ihe design 
under test. 

The results obtained with the PA 7100 were similar lo those 
obtained with the original chips. The HP-UX and MPE/iX 
operating systems were booted on first silicon. 
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Mechanical Design of the HP 9000 
Models 720 and 730 Workstations 



The CPU board. I/O board, graphics board, power supply, mass storage 
tray, and EISA board assembly are designed as easily accessible modules 
to support the design goals of low cost, accessibility, serviceability, and 
manufacturability. The appearance is new, attractive, and compatible with 
existing HP computer products. 

bj Al ien L. Roesner and John P. Hoppal 



The dramatic increase in performance achieved hy rhe 
HP 90O0 Series 70(1 workstation computers did not come at 
the expense of the high emphasis on quality and usability 
expected of HP products. Good looks, accessibility, upgrad- 
abiliry. and serviceability were major object ives for the 
mechanical design. 

Package Design of the Models 720 and 730 

From the beginning, the Models 720 and 730 were emisioned 
as computers that house many different modules, all of w hich 
can be accessed and replaced by the customer for a variety 
of reasons. The modules play a major role in meeting the 
design goals of low cost, accessibility, serviceability, manu- 
facturability, and cosmetic appearance. Modules chosen for 
easy access were the CPU hoard, I/O board, graphics board, 
power supply, mass storage tray, and EISA board assembly. 
Designed in Series 300 fashion, all modules slide into the 
rear of the machine on specific card guides. Dialing with a 

backplane at the from of the machine. With the exception of 

the fans, front panel, and display circuitry, all functional 




Fig. i Disassembled HP9000 Model 720/730 SPU package 




lb) 



Fig. 2. Access to the disk tray. (Tl ' board. I/O board, graphics 
hoard. EISA board assembly, and power supjib Is from the rear. 

components are found within these easily removable mod- 
ules. Fig. 1 shows the disassembled Model 720/730 package. 

Access to modules is simple (Fig. 2). Each moduli' is re- 
tained in the system package by screws fastened through 
boles in an associated bulkhead panel. To access any mod- 
ule, a maximum of five screws must first be removed froin 
the rear, and then the module can be extracted by pulling On 
a handle. Access is equally easy for machines configured in 
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either of the supported oriental ions: desktop or deskside. A 
standard Hal-blade screwdriver is all thai is needed to 
remove cables and screws. 

Module Features 

The Model 720/7:10 power supply is small given Hie 200 
watts of power il delivers. Al 1.0 watts per cubic inch, it was 
a challenge lo 111 the supply into It8 enclosure. Kor safely 
reasons, Ihe supply is fully self-contained in its own Sheet- 
metal housing. The edges of Ihe printed circuit board pro- 
trude from Ihe sides of Ihe housing and are used to slide Ihe 
Supply inlo its card guides in Ihe system package. This con- 
cept was chosen in lieu of folded sheet-metal flanges on Ihe 
power supply housing because ii pro\ ides good registration 
of the power supply connector lo Ihe backplane connector 
and eliminates Ihe possibility of a nonfn situation caused by 
tolerance accumulation. Even with ibis feature, a set of 
blind-mating connectors is essential for the Implementation, 
To replace the supply, a bulkhead covering the supply must 
be removed from the rear of the computer. A handle attached 
to the supply is I hen pulled lo release the supply from the 
backplane and remove it from the box. 

The CPU, I/O. and graphics boards all are similar in size and 
lit inlo the middle compartment of ihe system package. To 
achieve as much airflow as possible, high-densily Dialing 
connectors (AMP Z-pack) were chosen for ihe backplane. 
These connectors allow for some misregistration and work 
well as blind-mating connectors, which was a necessity With 
the anticipated tolerance accumulations found in a card- 
guide application. .Since any one of the three boards can 
physically slide into any one of the three slols available, the 
system package is keyed by adding flanges to the side walls 
of Ihe cardcage. and notches are cut in Ihe front end of each 
board to correspond with ils associated flange. This pre- 
vents board insertion in Ihe wrong localion. which might 
resuli in the board's being desl royed upon power-up. 

The EISA board assembly consists of an adapier board 
(motherboard), the EISA board, and an attached bulkhead 
plate (F'ig. 3), The EISA board is attached lo the adapier 
board in an L-shaped fashion, with Ihe adapier board posi- 
tioned vertically and Ihe EISA board horizontally. The as- 
sembly is inserted into three card guides, one of which is for 




Fig. 3. EISA board assembly. 




Fig. 4. Disk; tray with external SCSI cable 

the exposed long edge of ihe EISA board. This concept is 
new to ISA/EISA accessibilily. The combinalion of Ihe EISA 
board and the motherboard is an easily removable assembly 
that does not require removal of the system package top 
cover as in lypical personal computer products. Since lop 
cover removal is nol required, a customer can easily recon- 
figure the EISA card without dismantling the monitor and 
associated cables from Ihe system. 

The disk tray (Fig. 4) is perhaps Ihe most interesting con- 
cept in accessibilily implemented in the Models 720 and 730. 
This tray permits the installation of up lo two 3!4inch mech- 
anisms (currently two fixed disk drives or one fixed disk 
drive and one flexible disk drivel. The Iray includes a Boat- 
ing blind-mating connector, which cables power to each 
mechanism. This mates with a separate backplane connec- 
tor inside the system package. SCSI signal cabling is pro- 
vided by a r>0-posilion ribbon cable, which is terminated in 
Iwo places on Ihe nay bulkhead. This establishes an internal 
backplane conneclion for power and an external cable Con- 
nection for Ihe SCSI. Willi the simple removal of the exter- 
nal SCSI cables and four bulkhead screws, the tray pulls 
completely from the system package without any other dis- 
assembly. This becomes a valuable tool for software loading, 
disk troubleshooting, and mass storage security. For soft- 
ware loading, another computer can be altached lo Ihe sig- 
nal poris of the system package by quick manipulation of 
some external cables. Software can then be transferred from 
a master disk to the local computer, after which the system 
package can be reconfigured with its own external cables 
anil act independently. Mass storage can be easily secured 
by removing a disk Iray and locking il inside a desk drawer. 

Design Considerations 

Along Willi Ihe design goals of a fast -I rack schedule, high 
quality, low cost, small package size, and good manufactur- 
ability, the Models 720 and 7:10 were required lo comply with 
regulatory (EMI/ESD). environmental (shock/vibration), and 
safely standards. As a product design is made more customer- 
accessible and therefore more complex, il is normally true 
thai all of these requirements become more difficult to meet. 
For the midrange Series 700, cost was nol as much of an 
issue as il would be for a low-end machine, bill cosl-culiing 

was still achieved by hard-tooling all of ihe many sheet-metal 
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components. Quality was increased by postplating the sheet- 
metal pans and encasing the package with plastic. Each 
module was aggressively designed to fit into as small a 
space as possible, and efforts were made to make even the 
fans and display hoards easy to install on the manufacturing 
floor. 

Perhaps the most difficult problem facing the Model 720.730 
package design with its many removable rear bulkhead pan- 
els, was in the area of regulatory compliance, particularly 
ESI) With clock speeds in the 50-io-bb-Mlrz region. CPU 
circuitry becomes very sensitive to electrostatic discharge. 
Points of entry for static discharge are any slots made by 
mating panels of the system package. With many bulkheads 
and thus many mat ing areas, the Models 720 and 730 pre- 
sented a big challenge. Consequent ly, most I/O connectors 
are outfitted with their own ESD gaskeis. and spring clips or 
dimples are added to the bulkhead surfaces to seal all holes 
or gaps. Around connectors, gaps are no more than 0.60 inch 
u idi- For overlapping metal surfaces, dimples are placed to 
provide no more than 0.50 inch of slot width. Bulkhead 
spring clip spacing is less than 0.75 inch. With the more 
stringent regulatory requirements expected in 1992. and 
increasingly higher clock speeds, it is expected thai these 
design parameters will become smaller in the future. 

EMI becomes a problem when currents are allowed to travel 
inside the machine and emanate as electromagnetic energy. 
A feature of (he Model 720/730 implementation I hat was 
aimed at controlling EMI is the design of (he disk tray com- 
partmenl. Because (he Model 720/730 package also supports 
a flexible disk drive, a large frontal hole was required in (he 
sheet metal This large hole presented a polenlial area for 
EMI radial ion. To reduce I his possibility. I he disk I ray com- 
partment is completely isolated from the other compartments 

by placing a sheel -metal wall between compartments, leav- 
ing only a small hole for the interfacing power cable. Disk 
signal lines are routed externally on the shielded SCSI cable. 
These two measures effectively control EMI emissions from 
the disk compartment 

In the area of environmental compliance, Ihe largest area of 
concern was the placement of disk drives in a nay Ihal 
slides into somewhat loose-filling card guides. Disk drives 
are historically very sensitive to shock and vibralion envi- 
ronments, and loose fits tend to amplify these levels, lb al- 
leviate ihis amplification, two measures are Implemented in 
the Models 720 and 730. The first is a panel-mounted bulk- 
head on Ihe tray. This provides some hard-mouiil features by 
Stabilizing (he rear portion of the tray. The second feature is 
the use of shock absorbing isolation gromniels between the 
drive and its associated mounting bracket These gromniels 
damp oul most vibrations at frequencies thai correspond 
with the dlive resonant frequency. These gromniels also 
allow greater displacement of ihe drives in shock testing* so 
adequate space for movement is designed into the drive 
brackets and disk tray. 

Safety compliance presented no serious challenges to Ihe 
Model 720/730 design, except thai all of the removable com 
ponenls of Ihe system are designed without sharp corners or 
edges lhat pose a danger to an operator. Pans are outfitted 
w ilh finger guards where cuslomers have access to (hem. 



The power supply's sheet-metal enclosure prevents any 
chance of electrical shock to the user. 

Industrial Design 

The industrial design for the Series 700 products took place 
in parallel With the industrial design efforts for the Series 400 
workstations. Two industrial design finite — ZibaDesign and 
Rochc-IIarkins — were hired to help with the industrial design. 
The HP corporate industrial design group was also heavily 
Involved. Most of the concepts and ideas came from a joint 
effort of the Series 400 R&D design teams in Chelmsford. 
Massachusetts and Fort Collins, Colorado and the Series 700 
BAD design team in Fori Collins. 

The overall industrial design objectives for (he Series 700 
workstations were as follows: 

• To create a new look for a new family of workstation 
products thai would visually convey innovation 

• To create a new look that would be compatible with both 
new and exisiing IIP computer products 

• To explore alternative volume configurations and shapes (o 
maximize performance and value 

• To create a design framework to guide the development of 
the workstations 

• To fit extremely well with an existing graphics product thai 
was expected to be included Wltil a large portion of the 
shipments. 

Model 720/730 Industrial Design Process 

The overall task was divided into five phases. During each 
phase, reviews were held with die IIP corporate industrial 
design group and with Ihe oilier groups developing new 
workstalions. This was done to ensure dial Ihe individual 

implementations worked together and with the existing 
computer products. 

Phase 1: Two-Dimensional Concept Development. This phase 
stalled by detennining Ihe product requirements from a 
component Standpoint. All of the Ihe internal components 
and their rough sizes were decided upon. Interconnect, ther- 
mal, EMI. and cuslomer access requirements were also eval- 
uated. I sing these requirements, hand-drawn sketches were 
done showing a number of possible layout and shape con- 
cepts. At Ihe beginning of this phase the following product 
reqiiiremenls were determined: 

• Customer access to the major individual component s. This 
was very important. A design dial was easily serviced, 
manufactured, and upgraded was also desired. 

• Support for both desktop and deskside eonfiguralions us- 
ing Ihe same basic package. This implied both vertical 
and horizontal orientations of (he product in the entl-use 

configuration. 

• Support for two 3f -inch disk drives. ( >ne of Ihe drives could 
be a flexible drive and thus would require frequent access 
by the end user from the outside of Ihe package. 

• Support of a single 1 i panel graphics board. 

• Support of a single 'A-panel processor board. 

• Support of a single ' vpancl I/" board. 

• Support of eight small iiienion boards. 

• Support ol a 2(H) wall power supply. 

• Support of a single EISA slot along with an interface board. 
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Fig. 5. IIP 0000 Model 720/7:50 Sir package in Ih.- horizontal 
orientation. 



Forced convection cooling. II was decided that llie airllow 
direction would be from to hark with I lie cool air filtering 
Ihe fronl of the package and the hot air exiling the hack of 
the package. The airflow direction Was chosen to keep the 
heated air from exiling directly out to the end user. This was 
considered especially important in the desktop configuration. 

Phase 2: Two- and Three-Dimensional Concept Development. 

During the second phase, foam core inockups of the v arious 
components were created and used to explore the alterna- 
tives from Phase 1 in more depth. Having the foam core 
components also gave heller visual feedback on overall 
shapes and sizes. 

Phase 3: Concept Refinement. As we exiled phase 2. we had 
decided upon two or three of the best alternatives. The de- 
liverables from Ihe I bird phase were urethane foam models 
and their corresponding layouts. Some preliminary detailing 
was also looked at during this phase. 

Phase 4: Concept Refinement, hi this phase the final direction 
was chosen. All of the first-pass detailing for the chosen 
alternative was completed. The deliverable from this phase 
was a single urethane foam model showing all the proposed 
detailing. The corresponding layout was also included. 

Phase 5: Concept Finalization. The final phase consisted of 
reviewing Ihe model and making minor detailing changes 
based on inputs from the groups involved. Final cosmetic 
control documents, color requirements, and one full-size 
hard cosmetic model were outcomes of the final phase. 

Family Concept 

< >ne i if i he major requirements for the new look was to 

create a family appearance for all the new workstation prod 
nets while still leaving enough freedom for each product to 
deal with its own set of specific requirements. This was Ihe 
most difficult requirement to meet. By means of frequent 
meetings and discussions among all the groups involved, a 
set of guidelines was determined that would be used to ac- 
complish this goal. The major unifying pieces decided upon 
were: 

The use of parchment white as the primary color. This also 

helped to tie in existing UP products. 

The use of intersecting rectangular shapes. 

The use of color change I dove gray) to specify areas for 

customer interaction. 



• The use of the radii specified for the existing IIP Computer 
pn iducls, 

• The use of a small recess to separate the front panel visually 
from (he rest of the package. This feature is common to all 
products that follow the industrial design standard of the 
existing III' computer products, 

• The use of standard HP colors for Ihe LEDs, keycaps, and 
switch buttons. 

• The use of the existing products' grill pattern to tie in these 
products. 

• The use of rectangular nameplates that can be rotated so that 
they work in both Ihe horizontal and vertical orientations. 

• Thi' use of specific LEl)s to show the end user overall activ- 
ity. The five specified are power on. heartbeat, incoming 
LAN activity, outgoing LAN activity, and disk activity. The 
icons used for describing the LEDs are also specified. 

• Fronl access to the power switch. 

Model 720/730 Package 

The Model 720/7:10 package is a sheet-metal enclosure to 
which cosmetic plastic covers are added. The cosmetic 
pieces form overhangs on the top front and lop right of Ihe 
package. These overhangs give the impression of two distinct 
intersecting rectangular boxes. 

The bottom, top. and sides of the enclosure are covered by 
two large parchment-white pails. These parts also form the 
overhang on the top right edge of the product. When the unit 
is used in the horizontal configuration, four small parchment- 
while rubber feet are added to recesses in the bottom cover. 
When the product is positioned vertically, the left side of the 
package then becomes Ihe bottom. There are small hole 
plugs in the cover that are removed in this case, allowing 
access to two press-in mils. Screws are then used to attach 
the pedestal fool to Ihe main unit The pedestal helps sup- 
port Ihe product and keep it from tipping over. The pedestal 
is an [-shaped foam molded pan which visually appears as 
four separate rectangular feel when in Ihe installed posilion. 













i 





Fig. 6. Ill' 11000 Model 720/7:10 SPU package in tin- vertb at 
orientation. 
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Fig. 7. ' lontrol panel area (■> I With the door 1 losedj Ctic Bvo pri- 
mary \J£Db can be Been from the outside, do With the door offers 
the remaining USDs ami switches are accessible. 

The cosmetic fiponl Of the enc losure is formed by cither 
three or four separate pans depending on whether the op- 
tional flexible disk is present. The from lace of the package 
was die major Industrial design challenge since il had to 
work well with the horizontal grill pattern used on current 
III' products, whether the product was used horizontally or 
vertically. This could have been accomplished easily by the 
use of t wo distinct front panels, bul it was fell thai il would 
be much more desirable and easily managed if one universal 
front panel could be used. It was determined dial the major 
ily of die users would use the product in the vertical orienta- 
tion. Therefore, il was decided thai the grill pattern should 
run horizontally when the product was used in the vertical 
orientation. The issue then was to make the producl also 
work with vertical grills when in die horizontal orieniation. 
'fins was accomplished by adding detailing perpendlCUUUf to 



the grillwork. Four major details give the feel of horizontal 
lines even though the grillwork is vertical. The first detail is 
a .Vmm-wide recess along the entire front of the product and 
around the comers of the front panel. The second detail is 
the overhang. This is a very" strong horizontal feature that 
tends to lessen the visual effect of the vertical grillwork. The 
third detail is a horizontal line funned at the intersection uf 
the control panel l>eze! and the Control panel door. The final 
detail is the horizontal slot for the flexible drive. The unit in 
a horizontal configuration is shown in Fig. In Fig. 6. the 
unit is shown mounted vertically w ith the pedestal foot. 

The control panel area of the from panel is loc ated at the 
right edge of the front panel in the horizontal orientation or 
the lop of the fronl panel in the vertical orientation. It is 
fonned by a combination bezel and door. When ihe door is 
closed, ihe five primary LEDs can be seen from the outside. 
By opening the door, the user has access to the remaining 
troubleshooting USDs and the service/noniial and soft reset 
switches. The keycaps for the two switches are molded, 
using custom HP colors. The r>-niui-wide recess is molded 
into ihe bezel, making that feature How completely across 
the fronl face. A closeup of the control panel area can be 
seen in Fig. 7. In Fig. 7b. the door is open. 

If the unit has a flexible disk drive, then a different bezel/ 
door combination is used. This assembly has a rectangular 
hole into which the disk drive bezel slides. The bezel was 
retooled to add the 5-mni recess feature (see Fig. S). All of 
the control-panel bezels, the door, and the flexible drive 
bezel, door, and bullon are molded in the HP specified dove 
graj color. 

The graphics package was also an integral part of Ihe de- 
sign. Its package had already been completed, and initially, it 
was unclear how to make il work with the Models 121) and 
730. Il was determined that two different bezels should be 




Pig. 8. Hi '/.el and doordesign for the flexible, disk drive 
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done for I he graphics package. One bozo! has vertical vents 
and is used with the Models 720 and 780 in ihe vertical 
oriental ion. The second bezel has horizontal vents and is 
used in the horizontal configuration: In addition, the graph- 
ics lab designed a mounting scheme that allows the graphic 
package lo be hanl-mounled to the SP1 " easily. A Model 720 
workstation with the optional graphics package is shown in 
Fig. 9. 
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Meeting Manufacturing Challenges for 
PA-RISC Workstations 



To meet the time-to-market goals for the HP 9000 Series 700 workstations, 
major contributions were made in design for manufacturability and in 
expediting standard processes. One manufacturing operation installed a 
new surface mount production facility and developed a new printed circuit 
production process simultaneously. 

by Spencer M. Ure. Kevin W. Allen. Anna If. Hargis, Samuel K. Hammel. and Paul Boeber 



In the IIP HOOD Series 700 workstation development projeel, 
manufacturing was challenged to participate with the design 
and marketing learns to meel the time-lo-market goals for 
the Series 70(1 while significantly improving the manufaelur- 
ability of PA-RISC workstatioas. The Series 700 workstations 
were I he first of a new generation of PA-RISC workstations 
for which high-volume delivery requirements were imposed 
on the worldwide manufacturing operations within the HP 
Computer Manufacturing Division. 

Manufacturing management was represented on the 
Series 700 program team by a manufacturing development 
engineering manager to provide direction to the worldwide 
manufacturing team. Overall tactical program management 
was led by a manufacturing new products engineering 
manager. Projec t managers were identified at each of the 

' Pans ol this article are reprinted with permission from Circuits Assembly maga/ine 
September 1991 1 



computer manufacturing operations around the world. Each 
manufacturing operation created a manufacturing tactical 
team responsible for local issues (see Fig. 1). 

Strategic Contributions 

Manufacturing was challenged to make major contributions 
in two key areas: design for manufacturability and expediting 
standard processes. 

Design for Manufacturability. In this area the challenge was to 
deliver a new printed circuit board assembly process that 
would allow the hardware design team to lake full advan- 
tage of the performance capability Of* the VLSI designs for 
the Series 700. The goal was divided into two parts. One was 
to release a printed circuit assembly process that provided 
surface mount capability along with through-hole parts (pin 
grid arrays and connectors) on both sides ofa 12-layer 
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printed circuit board. In parallel With I his development, one 
manufacturing operation Was required to install and release 
a new surface mount production facility, anil at the same 
lime support the Series 700 project and meet the Series TOO 
lime-to-market goals. 

Other goals were to deliver a new. highly responsive soft- 
ware release process within manufacturing, and to deliver a 
new test process to reduce the cost of testing and the time 
required to diagnose and isolate failing components and as- 
semblies. The goal was to reduce lest and diagnostic- lime to 
less than 50% of the time required for the previous generation 
of PA-RISC workstation products. 

Expedite Standard Processes. The hardware release team used 
an IIP standard process called the Hardware System Prod- 
uct Life Cycle (IISPLC) as the guideline for release of the 
Series 700 from engineering into manufacturing. To achieve 
the lime-to-markel goals, it was critical that time lines be 
managed within the IISPLC to minimize unnecessary delays. 

The key delivery requirement identified as most critical was 
to provide rapid turnaround lime for new prototype hard- 
ware designs and delivery of prcproduction systems. The 
goal wiis lo eliminate any noncritical issues that might delay 
delivery, and to allow the manufacturing team to focus on a 
few critical issues and minimize their impact on the schedule. 

R&D ;uid manufacturing used two-person bring-up learns lo 
facilitate the fastest possible delivery of prototype printed 
circuit assemblies and systems. Each member of the team 
was responsible for supporting the other member by looking 
for ways lo eliminate problems before they happened. 

Printed Circuit Assembly Process 

Developing a high-mix, medium-volume surface mount 
manufacturing process is a challenge. On the technical side, 
equipment selection and installation, line Hows, intercon- 
nect technologies, product mix. and setup must he addressed. 
On the operational side, engineering, technician, and opera- 
tor staffing and training must be considered. Adding the 
variables of a new product and a new technology provides a 
demanding and risky challenge. This was the situation fac- 
ing Hewlett-Packard's Colorado Computer Manufacturing 
Operation (CCMO). 

The Series 700 family of workstations was already under 
development in R&D. The printed circuit assemblies were 
designed ami laid out to meet an aggressive schedule, and 
CCMO was given until October 1990 to prepare to assemble 
prototype boards. 

A cross-functional team was established, since loading a 
surface mount board involves more than just the physical 
assembly. This team consisted of representatives from pro- 
cess development engineering, finance, computer systems, 
materials engineering, purchasing, production control, test, 
quality, and production. The team's first assignment was to 
establish goals and objectives for the new production line. 

A \ision of the surface mount manufacturing process was 
developed, and goals and objectives were defined. The goals 
were to minimize space, inventory, direct labor, and oxer- 
head while maximizing flexibility. Specific, measurable 



objectives were established to compare with the current 
through-hole process metrics in the areas of responsiveness, 
quality, and cost. 

After discussion of process options, differences between 
surface mount and through-hole technologies, and the 
changes required in each of the organizations to accomplish 
these goals, the needs of each area were defined. ( )ther 1 IP 
entities with surface mount manufacturing capability were 
\ isiled to discuss their successes and areas of concern. A 
slralegy was defined for this new surface mount manufac- 
turing process by leveraging the experience of other sites, 
working with the CCMO surface mount learn, and modeling 
the production process, This Strategy allowed us to achieve 
our goals in a rapidly changing technical environment while 
attaining competitive assembly costs. 

Technology 

Over the last six years. Hewlett-Packard's product genera- 
tion process organization has developed, characterized, and 
documented a set of core processes. These consist of one 
single-sided and two double-sided processes, which HP has 
designated SMT-1, SMT-2. and SMT-4. 

Because of performance requirements for the Series 700 
CPU board, the printed circuit board design was not com- 
patible with any of the released processes. The SMT-2 pro- 
cess guidelines were violated by the presence of fine-pilch 
components, over 2100 through-hole leads, and a 0.079-inch- 
pilch through-hole connector. The SMT-1 process guidelines 
did nol accommodate the design requirement for SOJ-26 
SRAMs, tantalum capacitors, and SOT-2:J components on the 
bottom side of the board. 

However, a new process, SMT-2thp, was under development 
by the product generation process organization. The objec- 
tive of SMT-2lhp was to expand the current SMT-2 process 
capability to include thicker boards, more through-hole 
components, fine-pitch surface mount components, and 
finer-pitch through-hole components. SMT-2lhp was consid- 
ered a high-risk development because of the mutually exclu- 
sive objectives Of trying to push solder paste into through- 
holes while applying a minimal amount of paste lo the 
line-pitch pads. CCMO joined the development effort to en- 
sure that SMT-2lhp would be an acceptable manufacturing 
process for the Series 700. 

Production Equipment and Strategy 

As the technology requirements were being defined, the 
equipment procurement process was initiated. Product char- 
acterization was completed through an understanding ofthe 
panel sizes, surface mount versus through-hole component 
mix, surface mount component sizes, and lead pitch require- 
ments. The standard HP surface mount equipment capabili- 
ties were reviewed to determine whether they would meet 
the project goals and objectives for the given product mix. 
The remaining process equipmenl was selected and pur- 
chased to enhance the standard equipment's ability lo meet 
the goals and objectives. 
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The manufacturing process was then modeled to determine 
the optimum production strategy. Many scenarios were ana- 
lyzed to determine the competitive solution. A sensitivity 
analysis was conducted to determine the effects of technol- 
ogy, product mix. and volume changes. Based on this analy- 
sis, continuous flow manufacturing philosophies with a 
family-based product line were adopted to achieve the 
team s flexibility and responsiveness goals. 

High-Quality Printed Circuit Assembly 

The assembly of high-quality products begins in the product 
design phase. Hewlett-Packard has developed and maintains 
standard design guidelines for printed circuit boards as- 
sembled at its manufacturing entities. These design-for- 
manufact urability guidelines assist designers in making de- 
sign/manufacturing trade-offs. They also reduce the risks of 
design related quality defects and simplify the transfer of 
products between HP sites. These guidelines serve as a ve- 
hicle for relaying information betw-een manufacturing and 
the design labs. 

To prepare for the surface mount implementation, the super- 
visors and operators received surface mount training. Twelve 
operators were loaned to the HP Colorado Surface Mount 
Center (CSMC) for hands-on surface mount equipment and 
process t raining. In combination with this core set of trained 
operators, additional operators were used at the facility 
whenever they were required. This minimized costly learning 
curve inefficiencies. 

Supervisor training was initiated by inviting a supervisor on 
the through-hole line to become involved early in the devel- 
opment of the team's process vision, goals, and objectives. 
Process engineering met biweekly with production opera- 
tors to solicit inputs and discuss the surface mount process. 
Both the team and the long-term manufacturing process 
benefited front the expertise and knowledge gained through 
this early training and teamwork. 

Accurate, up-to-date documentation and training arc keys to 
quality manufacturing. The process development engineers 
worked with production and technicians to provide docu- 
mentation to describe the manual assembly processes, ma- 
chine operations, and the flow strategy within the shop. Pro- 
cedures were written for each step in the assembly and test 
process. 

An operator/tecluiician surface mount technical training and 
certification program was then established to qualify opera- 
tors tot each job. Initial operator training was completed by 
process development engineers. This training was based on 
the machine, assembly, test, ESD, safety, and flow strategy 
documentation. Ii included a review of procedures, hands- 
on machine training, and flow training. A workmanship 
training program was added to help the operators recognize, 
report, and repair quality problems. 

Statistical process control charts were kept at each of the 
machines to monitor and maintain process control of the 
key variables. The advantage of using these charts is illus- 
trated by the case of paste viscosity testing. Any batch of 



paste that measured beyond the specification limits was 
discarded. Operators used only batches that measured with- 
in the control limits, thereby preventing potential solder 
defects. 

Operators inspected their own work throughout the process 
and checked for solder defects and other obvious problems. 
In addition to the statistical process control efforts, if an 
operator discovered a problem generated in another area 
these findings were immediately communicated to the 
source. The proximity of the assembly and test areas fos- 
tered this quality feedback loop. This immediate feedback 
along with the minimal levels of work-in-process reduced 
the risk of producing large quantities of defective printed 
circuit boards. This resulted in a reduced defect rate, sim- 
plified repair loops, increased output, and decreased 
throughput times. 

Results 

As a result of the planning, development, and teamwork 
efforts, the responsiveness, quality and, cost goals were 
achieved. Process development was completed in time for 
the product ramp. The space requirements were well within 
the goals, but also had the flexibility to change. 

Compared to the through-hole process metrics, the through- 
put time on the surface mount line was reduced by 83% aid 
t he parts-per-million defect level was reduced by 40%. The 
CCMO team's process goals were achieved. The CCMO 
surface-mount manufacturing process has the ability to 
manufacture high-mix, medium-volume lots of printed 
circuit boards. 

The quality level achieved far exceeded our expectations 
and the defect levels for both solder and component defects 
were well below the goals. This was achieved largely 
through joint design efforts of manufacturing and R&D. 

The manufacture of the Scries 700 boards began on schedule. 
The SMT-2thp process for assembling the Series 700 CPU 
board has been duplicated at I he printed circuit assembly 
facility at the Boblingen Manufacturing Operation (BMO) in 
Germany. 

Software Release Process 

Hewlett-Packard's soft ware and hardware manufacturing 
divisions worked together to improve the software release 
process for the Series 700 workstations. Software Reproduc- 
tion ;ind Distribution Operation (SRDO) and Exeter Comput- 
er Manufacturing Operation (ECMO) established a software 
delivery team to address software release problems. The 
goals were to reduce the turnaround time from the labs to 
shipment for HP-UX* software products and to improve 
communications between the functional organizations. The 
problem was broken down into (wo release processes, one 
for the HP-UX operating system and one for compilers and 
application software products. Each type <>r release required 
a different release model. 
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In the past, the software development laboratories released 
software to manufacturing randomly. Each software product 
was released as soon as the quality goals set for it were 
reached. Manufacturing attempted to prepare these prod- 
ucts for shipment on a first come, first served basis, but crit- 
ical needs sometimes caused sudden changes in priorities, 
Which made it difficult to achieve manufacturing efficiencies. 

The projecl team reviewed numerous options and changed 
the release model front random, unstructured releases to 
structured releases based on product needs. The HP-UX 
operating system release model was allowed to continue 
very much as before, that is, it is based on achieving finality 
goals and releasing as soon as goals are attained. The com- 
piler and application software product release process was 
changed from a first come, first served basis to a batch and 
critical-need release process based on a fixed interval of 
time. Products are accepted from the labs only in batches at 
two-month intervals. Manufacturing can I hen put each batch 
into full production before receiving the next batch. 

Structuring these releases into different models and commu- 
nicating the new release process to the labs allows a stan- 
dard release template for each product timeline. As long as 
the labs meet the timeline, the complete release can be 
shipped at once. In this way, SRDO is able to achieve operat- 
ing efficiencies and therefore reduce the time to market for 
the software products. This new application and Compiler 
software product release model is called DART ( Delivery 
Application Release Tram). 

The project team performed a design-for-manufacturability 
analysis on these products and recommended different 
packaging options to reduce the product cost. The standard 
set of HP-UX manuals shipped with every system was re- 
duced to a set of overview manuals and numerous miscella- 
neous items were eliminated. The team also created a dedi- 
cated manufacturing team at SRDO to manufacture the 
software packages, established communication vehicles 
between divisions, simplified the product structures, and 
delivered Series 700 systems with HP-UX operating system 
software and key application software products. 

The changes to the process have reduced turnaround time 
by 50%, reduced the product cost by 80%. and led to a suc- 
cessful and timely release of the operating system and com- 
piler and application software products on the Series 700 
workstations. 

Lowering Test Costs 

The test process for the Series 700 is highly automated at 
both the board and system level. In particular, the controller 
system for the the Series 700 board level functional test pro- 
vides hands-off. high-throughput testing as well as detailed 
data collection and tracking. The board test system also 
automated the socket board test for the three major ASICs 
I application-specific integrated circuits ) during the produc- 
tion ramp. It is a relatively inexpensive means of providing 
quick, accurate feedback on line returns and supplying 
production with reliable components. 



One of the primary objectives in the design of the board test 
system was to support automated functional testing and 
reliable data collection with throughput quantities of four 
times the forccasl at no additional cost. The ability of the 
board lest system to support four times the forecast has 
allowed CC'MO to ship virtually without delay when material 
problems demanded meeting commitments with sporadic, 
very high-volume builds. To improve throughput, the board 
test system also attempts automatic failure isolation to the 
failing component when possible and includes the suggested 
repair. Tin- board test system features such as voltage mar- 
gining, modular test applications, and automatic isolation, 
along with testability design hooks in the the Series 700 
hardware and firmware, have allowed CCMO to meet cost 
and quality objectives for the Series 700. 

Another important objective of the board test system was 
ease of use (no training required) by conforming to emerg- 
ing user interface standards. Because the schedule for the 
new user interface standard did not coincide with the Series 
700 schedule, it could not be used. The board test system 
user interface, however, is quite user-friendly and meets the 
ease-of-use objective. The new interface standard can easily 
be employed when completed. 

Data Access for Repair 

The data collection and reporting capabilities of the board 
test system facilitate rapid improvement of both component 
reliability and the process itself. The board test system 
tracks every board tested. It records every test run and the 
corresponding output for boards that fail. The data for all 
boards shipped is archived monthly, allowing easy access to 
the history of any particular part number. In addition, the 
board test system manages a repair queue and automatically 
assimilates the data from the functional test and repair pro- 
cess into useful, readable summaries. 

The board test system is also the server for the technician 
repair system. Here technicians trotibleshoot board failures 
that the board test system couldn't automatically isolate. 
The technician repair system provides access to all of the 
board test system's data so that technicians can quickly see 
the history of every board needing repair. Repair entry fea- 
tures and several other aids are available to facilitate trou- 
bleshooting and repair of the Series 700 boards. Both the 
board test system and the technician repair system make 
batch reports nightly. 

System level testing is automated by the system test control- 
ler. The configuration and status of each system are tracked 
all the way through the process. This controller is linked to 
the data tracking system and quickly identifies the unit 
needing replacement when a failure occurs. 

The high level of test automation at both board and system 
lev els is a key factor in the manufacturing of the Series 700 
workstations. It pro\ides the ability to ship high-quality 
workstations at quantities well in excess of forecasts when 
material problems make it necessary. In addition, the easy 
access to detailed data facilitates the repair of precious 
printed circuit assemblies. 
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Quality 

Quality was a primary concern throughout the development 
and prototyping of the first Series 700 products. Product 
quality was never compromised. The Series 700 team set 
Forth a rigorous collection of qualification and production 
processes to ensure that Series 7(H) products would be of 
the highest delivered quality. The key items that were put in 
place include: 

• A rigorous product qualification lest 

• A rigorous board and system production test process 

• An oven aging process for early prototype and early 
production units 

• A commitment to developing a highly reliable production 
process in preparation for volume shipments. 

The quality organization and the R&D lab undertook the 
product qualification process, while Uie new product engi- 
neering teams in manufacturing undertook the task of devel- 
oping the required production processes. After designs are 
verified, strife (stressed life) and environmental tests are 
used to identify latent defects in product hardware. Systems 
are tested under varying conditions: temperature, humidity, 
shock, vibration, and altitude. These factors accelerate nor- 
mal product stress and determine whether or not the system 
meets design specifications. When the qualification has been 
completed, one can be fairly certain that the product will 
operate properly in all required customer environments, and 
will withstand normal use and abuse. 

Test Processes 

At board test, a series of offline diagnostics and online exer- 
cisers were run during early production to ensure thai com- 
ponent or process defects were delected as quickly as pos- 
sible. As the production process improved and component 
problems were isolated and corrected, tests were removed 
lo reduce the cosl and lime required to produce the product 
(see Fig. 2). 

System test is broken inlo four sections: integration, oven 
age. run-in. and posl. At integration, assembled systems are 



connected to keyboards and monitors and offline diagnos- 
tics are run. Integration testing is done to verify that the 
system is assembled properly and that there are no major 
problems with system components. Next, systems move to 
oven age. where they undergo temperature cycling while 
running offline diagnostics, ( hen age testing is done to 
stress system components and determine if there are any 
temperature sensitive components or process defects. Sys- 
tems then move to run-in. where they are power cycled and 
offline diagnostics and online exercisers are run. The length 
of time spent in run-in depends on product quality As the 
quality of the products and the process improves, run-in test 
time can be reduced. Finally, systems move to post test, 
where monitor and keyboard functionality are checked one 
last lime before boxing the system. At post test, an operator 
boots the operating system and runs a quick graphics test 
program. 

The oven age process was designed to run on early proto- 
type systems and to be eliminated before 4 entering full pro- 
duction. For all Series 700 products this has been accom- 
plished. The earl\ prototypes were run through a series of 
one-hour temperature soaks at 0°C and 55°C. With each new 
introduction, the oven age process identified process, com- 
ponent, and design flaws early in the process, and before 
systems were shipped. This early detection of faults was 
instrumental in holding to schedule. Soon after flaws were 
corrected, the oven age process was no longer needed and 
was removed. The process is still used to qualify new com- 
ponents or design changes, but has no! been necessary lo 
maintain product quality. 

With volume readiness being the focus, processes need to be 
able to support the required flow of product, and product 
and process quality levels need to be at or near established 
goals lo maintain the production process cosl effectively. 
Series 700 process readiness and quality were in place at 

introduction to support a volume ramp. The best testimony 

to volume readiness came in October [091, when nearly 
5000 Model 720 and 7:10 workstations were delivered in less 
I ban tour weeks. 
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High-Performance Designs for the 
Low-Cost PA-RISC Desktop 



This paper presents the processor, memory, graphics, multimedia, and 
built-in core I/O design of the new HP 9000 Models 705 and 710 
entry-level, scalable, PA-RISC workstations, The use of a buffered 
CPU/memory interconnect is important for scaling the high-frequency, 
high-performance processor design to the entry-level desktop. 

by Craig R. Friiik. Robert J. Hammond. John A. Dykstal. and Don C. Soltis, Jr. 



Tin- HP 900(1 Models 705 and 710 (Fig. 1 ) are new, entry-level 
members of tlie Series 700 performance workstation family. 
They complement Hie higher-performance Models 720. 730, 
and 750 Workstations (see article, page (3). offering lower 
cost and broad functionality. The Series 700 workstation 
products are based on Hewlett-Packard's PA-RISC' architec- 
ture, 1 providing application-code compatibility across a 
broad range of applications. The systems address applica- 
tions ranging from those characterized by cost sensitivity, 
diskless workstations, and grayscale graphics to those 
requiring higher-performance 3D color and maximum 
computational ability. 

The 35-MHz Model 705 is designed for desktop applications 
that can benefit from 35-MIPS. 8.4-MFI.OPS, and 34-SPEC- 
mark performance at the lowesl cost.* The highly integrated 



graphics system design produces graphical user interface 
((il l) performance exceeding 4800 (geometric mean) on the 
XHperf benchmark and 450,000 2D XI 1 vectors per second. 

Cost-sensitive applications requiring even greater perfor- 
mance will benefit from the 50-MH/. Model 710 color work- 
station. It produces 59-MIPS. 12-MFLOPS. 49-SPECmark 
performance, lis GUI performance exceeds 7290 (geometric 
mean) on the XHperf benchmark and 500,000 2D Xll vectors 
per second. This makes it an ideal platform for low-cost. 2D 
electrical and mechanical computer-aided design. 

MIPS stands lor million instructions per second MFL0PS stands for million floating-point 
operations per second Tne SPEC benchmark suite is a set of benchmark routines representing 
realistic tasks the SPECmark number is the geometric mean of the SPECtatios lor all Dl the 
routines Trie SPECratio is obtained by dividing a reference machine's time to complete a 
routine by the test machine's time The reference machine is a VAX 1 1/780 computer 




FiK- l.TheHP90Q0 Models 705 
(right ) urui 710 (Mi ami center) 
Workstations are 35-MHz and 
60 j MHz entry-level members of 

the IIP INK ID Series 7(10 family 
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The Model 705/710 design produces an Unparalleled level of 
desktop price, performance, and functionality in a quick- 
timc-to-market design. Through the integration of system 
components and architecture awl the leveraging of other 
Series 700 designs, the Model 710 was ready to ship in less 
I nan 12 months from die inilial product concept Both models 
support up io 64M bytes of memory, an 8-kHz-sample-rate 
voice-quality audio design, and buill-in I/O interfaces includ- 
ing intelligent, 32-bit, DMA-capable SCSI II and IEEE 802.3 
LAN. Both models also support both grayscale and color 
configurations. 

This paper details aspects of the Model 705 and 710 design 
that were Important in producing a single, reliable, low-cost, 
high-performance, grayscale and color desktop system de- 
sign on an aggressive 12-month schedule. It also compares 
the processor and graphics performance of the Models 705 
and 710 with respect to other desktop members of the 
Series 700 product family. 

System Overview 

The Models 705 and 710 are designed to meet the need for 
low-cost desktop systems while providing scalability to 
higher performance. A single system board design enables 
the Model 705 to offer a very low price, while frequency 
scalability enables the Model 710 to reach higher perfor- 
mance levels at approximately twice the price. The Model 
705 is able lo use lower-frequency components — PA-RISC 
CPU. floating-point coprocessor, and cache SRAMs — to 
meet lower-cost objectives. The Model 710 uses higher- 
frequency components to reach higher performance levels. 

The single system board design for the Models 705 and 710 
is shown in Fig. 2. It fits within a desktop package 14 inches 
deep by 16 inches wide by 3 inches high, with the system 
electronics integrated on a single printed circuit board. The 
package accommodates two 3.5-inch storage devices with 
fixed or removable media. To better accommodate multi- 
media applications, the package integrates a CD-ROM player 
with no increase in desktop area. 



The Model 705 provides the follow ing features: 

• 35-MHz PA-RISC CPU 

• 35-MHz PA-RISC 64-bil floating-point coprocessor (FPU) 

• 32K-byte instruction cache 

• 64K-byte copyback data cache, 04 bits wide 

• 5-megabyte-per-second SCSI II interface (NCR53C700) 

• 10-mcgabit-per-second IEEE 802.3 LAN interface ( Intel 
82596 with built-in ThinLan transceiver and Al 'I ) 

• 8M to 04M bytes of error correcting SIMM memory 
(8M bytes in diskless configuration only) 

• Integrated 8-kHz, 8-bit, voice-quality audio 

• 8-plane 1280-by- 1024-pixel color, 8-plane 1024-by- 768-pixel 
color, or 8-plane 1280-by- 1024-pixel grayscale graphics 

• Two integrated 3.5-inch 420- or 840-Mbyte fixed disk drives 

• One of the following three removable media devices ( uses a 
fixed drive position): CD-ROM. 3.5-inch PC-compatible flex- 
ible drive, or 3.5-inch 2.0-Gbyte DDS (digital data storage) 
tape drive. 

• Two RS-232 modem-control serial ports 

• 350-kbyte/s bidirectional Centronics port 

• HP-HIL device interface for a keyboard and up to 6 other 
possible pointing or input devices. 

The Model 710 enhances Model 705 performance and 
functionality with the following differences: 

• 50-MHz PA-RISC CPU 

• 50-MHz PA-RISC 64-bit floating-point coprocessor. 
Processor Design 

The processor consists of a PA-RISC CPU,- a PA-RISC 
floating-point coprocessor, ' and separate instruction and 
data caches. It executes one instruction (integer or floating- 
point ) on every clock cycle. The instruction cache and 64-bit 
data cache are accessed simultaneously, producing an exe- 
culion rate exceeding 57 MIPS and 12 double-precision 
M FLO PS in the Model 710, and 35 MIPS and 8 double- 
precision MFLOPS in the Model 705. 
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Fig. 2. Uluck diagram of the system 
board of the HP 90(10 Models 705 
and 710 workstations. 
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The instruction and daia caches are separate, direct- 
mapped, virtually indexed, fast memories, each of which is 
capable of supporting an access every processor clock 
cycle. The instruction cache is 32 bits wide and 32K bytes in 
size, and is organized as 1024 lines of 32 bytes each. The 
data cache is similar in design, but is 64 bits wide and (i4K 
bytes in size, and is organized as 2048 lines of 32 bytes each. 
The data cache also enforces a copyback memory update 
policy. 

The CPU Interfaces with the memory anil system bus 
through the PBus. an interconnect defined for the PA-RISC 
CPU. This interface operates synchronously with respect 
to the CP!" and supports memory data transfer rates up to 
200 megabytes per second and I/O transfer rales up to 100 
megabytes per second. The small signal count — 53 signals — 
allows the design to interface easily with standard, low-cost, 
plastic quad flat package ( PQFP) ASK s i applicai ion-specific 
integrated circuits). 

Memory and System Bus Interface 

A standard method used lo conned the CPU. memory, and 
I/O subsystem in desktop systems involves the use of a 
three-stale bus. A bus arrangement often Suffers from the 
fact that a significant number of attachment points cause a 
large capacitivc load — and I he resulting low impedance — to 
be seen by a driving device. This r esults in whal has been 
termed the bus driving problem. 4 It can often lead to slow 
operation in systems relying on standard low-cost TTL or 
CMOS interfaces. 

In the Model 705/710 design, lite CPU is connected to the 
memory and I/O system components through the memory 
and System bus interface. The memory and system bus inter- 
face implements a small buffered interconnect between the 
CPU. the memory, and the I/O system bus. It is the active 
c(|iii\ alenl of the shared signal bus. in that it connects the 
CPU, memory, and [/< ) subsystems. However, it lias an ad- 
vantage in that the separate subsystem interfaces are con- 
nected In a poinl-lo-poinl fashion. In the Model 710 design, 
the isolated interconnect and resulting transmission line 
characteristics make ii possible to operate the ( PU and its 
PBus interconnect at 50 MHz using standard 8-mA TTL' 
compatible CMOS buffers. 

Fig. 3 shows the memory and system bus interface and its 
connections td the CPU PBus. the memory controller ports, 
and the system bus. In the Model 710, the CPU PBus and the 
system bus interconnect operate at 50 MHz and 25 MHz, 
respectively. In Hie Model 705, the CPU PBus and the system 
bus interconnect operate at 35 MHz and 17.5 MHz, respec- 
tively. The memory and system bus interface is designed lo 
exploit the capability of the CPU PBus through the use of 
two interleaved memory controller data ports, each having 
up to 100-Mbyte/s capability. The memory and system bus 
interface also provides the interface lo memory for all l/< )- 
initialed DMA transactions. 

In an effort to reduce the size. cost, and power consumption 
of the design, logic normally found separate in the I/t > con- 
troller interface is Integrated into the memory and system 
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Fig. 3. M'iiiun and system tins interface interconnect ports. 

bus interface. This includes byte accumulation and replica- 
tion functions used in conjunction with the I/O controller to 
allow byte addressing of 8-bil-wide devices over the 32-bit 
system bus. The memory and system bus interface also con- 
tains logic to improve the performance of the CPU and ease 
the implementation of PBus interlace protocol. This is done 
by including memory eopyout and copyin buffers, and buff- 
ering the system bus interface to pipeline I/O and graphics 
write operations. 

Fig. 4 shows the important blocks and data paths of the 
memory and system bus interface design. 

The Memory and system bus interface is implemented by 
two 160-pin, standard-cell ASICs. The ASICs are identical in 
design, fabricated in Hewlett-Packard's low-cost. 1-um, 
t \H )S34 process, and packaged in low-cost I'QFPs. The 
memory and system bus interface ASICs Operate from a 
single clock, creating a synchronous interface to the CPU. 
memory, and system bus. The memory and system bus inter- 
face operates at 35 MHz and 50 MHz. respectively, in the 
Models 705 and 710. 

The memory and system bus interface is implemented with 
two smaller, plastic-packaged ASICs instead of one larger, 
more expensively packaged ASIC because of the need to 
Until total system cosl. including ASIC processing and pack- 
aging cost, prinled circuit board assembly cost, and syslem 
cooling cosl. Distributing the power and signal I/O between 
two chips instead of one allowed the design to use HP's, 
low-cost, quick-t ime-to-market. ( M< )S34 ASIC process. This 
resulted in a small, low-cost, silicon die wilh an I/O pad ring 
that is evenly matched to the active area and capable of being 
inexpensively packaged in a surface mount plastic package. 
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Table 1 provides an overview of the MP CM( >S34 process 
parameters used in the memory and system bus interface 
implementation. 

Table I 

HP CM0S34 Process Parameters 



N-channel FET L,, rr 
N-channel FET L,| R1W „ 
P-channel FET L,, fr 
P-channel FET tldmwn 



Metal I 
Metal II 



0.85 urn 
1.0 urn 
0.85 um 
1.0 urn 

Interconnect Pitch 
Contacted Uncontacted 



3.8 um 
3.8 um 



3.0 um 
3.0 um 



Table II shows some of the resulting specifications for the 
memory and system bus interface ASICs. 



Table II 

Memory and System Bus Interface Chip Specifications 



Die Size 

Total Used Gate ( 'ount 

Pads 

Package 

Frequency 

Operating Temperature 

(junction) 
Operating Voltage 



0.7 by 0.7 cm 

18.000 (approximately) 

160 

100-pin PQFP 
dc to 50 MHz 

I1M) C 

4.5 to 5.5V 



Copyout Buffer. Tin' memory and system bus interface incor- 
porates memory copyout huffers to mitigate the effects of 
having "dirty" or modified data (resulting from the copyback 
cache architecture) written back to memory on a data cache 
miss. By buffering copyback data until the completion of the 
read miss, the read miss latency observed by the CPU is 
reduced significantly. This extra level of buffering is impor- 
tant, even for a low-end workstation, considering the proba- 
bility that., on average, half of the data lines written to 
memory contain dirty data. 5 

Copyin Buffer. The memory and system bus interface also 
incorporates separate copyin buffers for each of the mem- 
ory' bank Controllers. The memory subsystem is divided into 
two independent memory banks. These buffers are used to 
resolve asynchronous behaviors inherent in the design of 
the mtiltibank memory implementation during memory read 
operations. If one of the memory banks is busy during a 
memory read, then the data front the other bank is buffered 
in the copyin buffer until the busy bank responds. At thai 
time, data is transferred to the CPU. 

1/0 Write Buffer. The memory and system bus interface imple- 
ments a pipelined, separate address and data, system bus. It 
is designed for the efficient transfer of randomly addressed 
data values to and from I/O and graphics devices. The pipe- 
lined nature of the bus allows a randomly addressed 32-bit 
data value to be transferred on every 25-MIIz system bus 
clock cycle. 

In support of the bandwidth potential of the system bus. an 
I/O write buffer is used within the memory and system bus 
interface to overlap CPU PBus transactions with system bus 
transactions. This makes it possible for the CPU to write to 
I/O and graphics devices at its maximum rate of 50 Mbytes/s. 
This is an important feature, considering that without it. 
X Window display performance would have been degraded 
significantly. 

1/0 Byte Accumulator/Replicator. The built-in UO controller 
used in the Models 705 and 710 was designed for use in the 
HP 9000 Models 720. 730. and 750 systems. As part of the 
bus interface in those systems, the I/O controller manipu- 
lates latch-based data bus transceivers for the purpose of 
interfacing a 32-bit byte-aligned bus to 8-bit wide devices. 
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such as PROM. In the Models 705 and 710 design, the latch- 
based data bus transceivers controlled by the I/O controller 
are an integral part of the memory and system bus interface 
design. This results in a substantial cost, power, and board 
area savings for the system. 

System Clock Design 

The Model 705-710 design relies on standard, low -power, 
low-cost, TTL clock circuitry to produce well-defined, 
controlled-skew clock signals for the CPU, memory and sys- 
tem bus interface, memory, graphics, and I/O controller. In 
the Model 705/710 design, fixed-relationship clock signals of 
17.5 and 25 MHz and 35 and 50 MHz. respectively, are pro- 
duced for all of the system devices. The low -power, low-cost 
design uses minimal printed circuit board area compared to 
alternative implementations relying on ECL logic. 

CPU Clock. The Model 705/710 clock circuitry is designed to 
drive a combination of ECL and TI"L clock input structures. 
The PA-RISC CPU has a differential ECL clock receiver. The 
higher-performance Models 720, 730, and 750 designs imple- 
ment this clock circuitry using differential ECL logic (see 
article, page 23). This design provides precise dock skew 
control ( controlled, predictable, edge rates and small propa- 
gation delay variation ) and allows these models to operate 
at higher frequencies than the Model 710. However, these 
ECL designs tend toward high cost, power consumption, 
and printed circuit hoard utilization. 

The Model 705/710 design takes advantage of the differential 
input characteristics of the CPU dock receiver lo produce a 
single-ended TTL dock signal instead of a differential ECL 
clock pair. This TTL clock signal is generated by the system 
dock circuitry, delayed appropriately to meet the timing 
requirements of the Cl'l I and memory and system bus inter- 
face designs, and then driven lo one of the differential inputs 
of the CPU, which are terminated at the CPU. The termina- 
tions are determined by the transmission line characteristics 
and the limits imposed on the voltage swing of the clock 
signal. The voltage swing is limited to produce predictable 
rise and fall characteristics, This makes the threshold cross- 
ing more predictable with respect to the other differential 
input. The second differential CPU dock input is fixed by a 
voltage-compensated source at a voltage representative of 
the midpoint of the TTL dock signal swing, thereby produc- 
ing a symmetrical clock signal within the CPU Clock sym- 
metry is important for higher-frequency systems because the 
PA-RISC CPL' uses both rising and falling edges. 

Delay Elements. Many system characteristics interact to es- 
tablish the range of frequencies over which a synchronous 
design can Operate. These include ASIC device characteris- 
tics, interconnect topology, and the design of the system 
dock circuitry. The Model 705/710 clock design relies on the 
characteristics of inexpensive T\'\. clock driver components 
lo produce predictable output -to-oulpiil skew and delay. 
This establishes a fixed basic clock relationship. However, 
even with this basis established, it is often necessary lo ad- 
just each dock edge independently, This allows each 
clocked Interface to be tuned to its maximum Interconnect 



frequency, resulting in the optimal operating frequency for 
the system. 

The Model 705 710 design employes discrete delay elements 
to adjust each clock independently to an optimal intercon- 
nect relationship. The delay elements used in the Model 
705/710 design are implemented with discrete components 
as low-pass filters or transmission line segments. Two ca- 
pacitors and an inductor make up each delay element The 
values are chosen to preserve rise and fall characteristics. 
The design produces inherently predictable delay values 
with controlled edge and line matching impedance charac- 
teristics. Clock signals requiring longer delay values use 
multiple concatenated segments. The design is flexible, uses 
readily available components, and uses little printed circuit 
board area. 

Memorj Design 

The goal for the Model 705/710 memory design was to pro- 
vide a large, reliable, system memoiy that would produce 
leading-edge performance at very low cost. The design su|>- 
ports from SM to 04M bytes of memory using 72-pin com- 
mercially available single inline memory module (SIMM) 
cards. The memory design takes advantage of the CPU's 
copy -back cache architecture (CPU-initiated memory opera- 
tions are alw ays 32-byte cache blocks) to reduce the com- 
plexity and cost of implementing memory error detection 
and correction. 

Memory Error Correcting Code. The Model 705/710 memory 
design calculates the error correcting code on a 64-bit data 
w ord to minimize I he number of DRAMs needed in the de- 
sign. This results in sigriificanl cost savings compared lo an 
implementation calculating the error correcting code on a 
32-bit word. The 04-bil error correcting code requires only 8 
bits of DRAM while the 32-bit error correcting code requires 
11 hits of error correcting DRAM for the same amount Of 
protected memory. Additionally, compared to simple parity 
checking designs used in other desktop systems, the 04-bit 
error del eel ion and correction achieves the same DRAM 
cost, bill with much greater reliability. The Model 705/710 
design COrreCt9 all single-bit errors Oil the fly and delects all 
double-bil errors. Detection of all triple-hil and quadruple- 
bt1 errors in any adjacent 4 bits makes il possible lo delect a 
single failing 1-bit wide DRAM. 

Memory Bank Controller. The uieiuorv design cnnsisis of iw<> 

independent memory bank com rollers. Each memory bank 

controller supports I DRAM SIMMs for a system total of 8 
SIMMs. Two SIMMs are required per controller lo form a 
64-bit error detection and correction rnemoiy word. The 
combination of two memory bank controllers forms a 
128-bit memory word and has a cycle lime of SO ns (or 200 
megabytes per second) in I he Model 710 design. The mem- 
ory and system bus interface acls as I he ceniral memory 
controller, working in conjunction with the two memoiy 
bank controllers lo assemble memorj' data into CPU cache 
blocks during reads and distribute data to the bank 
controllers during memory writes. 
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The memory ami system hits interface interleaves the 
memory bank controllers on 4-bvte boundaries, producing 
even and odd memoty hanks. This interleaving makes it pos- 
sible lo overlap sequentially addressed 32-bit memory words 
by directing Ihem to alternate memory hank controllers. In 
this way, read-modify-write behavior resulting from a 32-bil 
I ).MA write (the error correcting code is calculated on a 
64-bit memory word) is hidden. This element of the design 
prov ides the necessary DMA memory bandwidth along with 
low -cost error detection and correction. 

Fig. 5 is a block diagram of a memoty hank controller. Ii 
consists of a DRAM controller ASIC, an error detection and 
correction data path ASIC, and SIMM memory. The hank 
controller receives its address information from the system 
address bus, while control is directed by the memory and 
system bus interface. Timing and control lo address, access, 
and refresh locations in the memory hanks are provided by 
I he DRAM controller ASIC. The detection and correction of 
errors and Ihe tunneling of R4 bits of DRAM data and 8 bits 
of error correcting code Into 32 hits are provided by ihe error 
del eel ion and correction data path ASIC. F.aeh ASIC is a 
low-cost HP CMGS34 circuit packaged in a 160-pin PQFR 

Graphics Design 

hi addition lo ("IT performance, graphical user inl efface 
(GUI) performance is often important in characterizing ap- 
plication throughput. Todays desktop applications are built 
on standard window environments like Ihe X Window Sys- 
tem to increase their ease of use. to provide a common look 
and feel to the user, and to allow greater application porta- 
bility, Providing high lev els of X Window performance re- 
quires a balanced system design characterized by high levels 
of integer and floating-point performance, efficient virtual 
memory supporl, fasl memory-lo-franie-btiffer data trans- 
fers, and a fast frame buffer. The Models 705 and 710 sup- 
port this balance by moving much of the graphics pipeline 
into Ihe CPU so that graphics performance gains by leverag- 
ing CPU performance. ( h erall system cost is decreased by 
eliminating redundant hardware." 

Model 705/710 graphics support consists of eight planes, a 
cursor, and a 256-enlry color look-up table. Three configura- 
tions are available: 12S0-by- 102 -1-pixel color or grayscale and 
1024-by-7l>8- pixel color. The Models 705 and 710 use the 
same graphics architecture as the rest of HP's Series 700 
family. The design also leverages significant amounts of 
hardware and software from the Models 720, 730, and 750. 

Fig. 6 shows the Model 705/710 graphics subsystem. There 
are four blocks of dedicated graphics hardware: the system 
bus interface. Ihe frame buffer controller, the frame buffer 
memory, and the video data path. 



System Bus Interface. The Model 705/710 graphics system bus 
interlace resides in ii single gate array on the system board. 
This ASIC accepts commands from the system bus. It gener- 
ates commands for two independent buses. One is a high- 
speed interface to the frame buffer controller, while Ihe other 
is an auxiliary bus lo the video data path. The high-speed 
interface between Ihe system bus interface and Ihe frame 
buffer controller follows a typical sequence. First, the system 
bus interface defines a data mode, which tells the frame 
buffer controller how all subsequent data is packed on the 
bus. For example, one data mode sends four 8-bit pixels 
down the 32-bil bus. Next, the system bus interface sends a 
beginning frame buffer address. Finally, a sequence of data 
packets follows, accompanied by an address modification 
rule. The frame buffer controller uses this address modifica- 
tion cufe to synthesize the addresses of subsequent pixels. 

The system bus interface provides a second important func- 
tion—it contains hardware for vector, rectangle fill, and 
block move command generation. Soil ware sets up the 
hardware with information about Ihe particular operation, 
and Ihe hardware then synthesizes the necessary commands 
for Ihe frame buffer controller. X Window library calls take 
advantage of this drawing support when possible, with soft- 
ware implementing the balance of the commands. 2D and 
3D wireframe applications are segmented so thai Ihe CPI 
performs display file and geometry processing and the system 
bus interface performs vector generation. 1 ' 

Frame Buffer Controller and Memory. The Models 705 and 710 
implement the frame buffer controller in a second gale array 
on the system board. The frame buffer resides in twelve 
lM-bit VRAM (dual-ported video DRAM) chips. This pro- 
v nles storage space for 128(1 • 102 1 8-bil pixels as well as 
offscreen memory for a shadow color look-up table and Ihe 
cursor bitmap. 

The frame buffer controller accepts commands from the 
system bus interface and builds six-pixel tiles to access the 
frame buffer. This means that every frame buffer memory 
cycle can read or write up lo six pixels in parallel, A single 
memory controller provides Ihe protocol for all of the frame 
buffer VRAM. The frame buffer conl roller organizes Ihe 
memory so that each VRAM row contains I wo scan lines of 
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information. With increasing rectangle fill and block move 
sizes, the frame buffer controller becomes quite efficient 
because it fully packs the six-pixel tiles, and it maintains 
memory i>age mode. 

A VRAM-based design provides several advantages for the 
graphics subsystem. VRAM has two independent and 
asynchronous ports: a random i>ort and a serial port. The 
frame buffer controller reads and writes the memory array 
through the random port. At the same time, the video data 
path reads the serial port for screen refresh. This simplifies 
the design of the frame buffer controller because the frame 
buffer controller no longer has 10 arbitrate between random 
port reads and writes and screen refresh serial reads. Also, 
this isolation of screen refresh ac cesses increases random 
port bandwidth between the frame buffer controller and the 
frame buffer. The frame buffer controller takes advantage of 
other features of the VRAMs to increase the frame buffer fill 
rate. 

Video Data Path. The video data path consists of two 1( s. a 
third gate array (video data path multiplexer) and a Brook- 
tree Bt458 RAMDAC. The video data path is the output end 
of the graphics subsystem, and its primary responsibilities 
are screen refresh and look-up table support. 

The gate array receives data from the serial ports of I he 
frame buffer VRAMs. and receives contri il from the frame 
buffet controller. Ii accepts six pixels in parallel and serial- 
izes the data to four pixels in parallel before sending it to 
the RAMDAC. This gale array also detects when the cursor 
should be displayed and shifts this data out in place of 
regular image data. 

The RAMDAC contains the 25(i-eniry look-up table. It also 
performs I he final serialization of the frame buffer data, and 
converts the resulting full-speed pixel stream lo analog 
signals lhal drive the monitor. 

Audio Design 

Through Ihe inlegralion of audio and video capabilities on 
I he desklop. miillimedia applications Can provide important 
ways to improve user productivity. All hough many possibili- 
ties, exist for the application of Integrated v ideo mid CD au- 
dio, studies indicate I hat voice-quality audio is ihe single 
most Important capability needed for improving user pro- 
ductivity. For this reason, the Models 7(15 and 71(1 include an 
inexpensive voice-quality audio design as pari of the base 
product, instead of more Complex and cosily audio and 
video capabilities. 

The Models 705 and 710 are designed with ihe built-in capa- 
bility to record and play back voice-qualily sound, w here 
voice-quality means an 8- kHz sample rale and 8-bit or l(i-bit 
samples. An internal speaker, a headphone conneclor, and a 
microphone connector are prov ided as audio destinations 
and SOUtCes. The audio design stresses low cost, while pro- 
viding 'be fuiuiionality necessary for voice-qualily audio 
applications. These applications include voice annotation, 
voice notification, computer-based I raining, and emerging 
application areas such as verbal im recording and audio 
command and control. 

Audio Software Support. The amlto subsystem is designed to 
work within a client/server configuration w ith Ihe client 



running either reniotelv or locally. Software support for au- 
dio application developers Ls provided as a standard part of 
HP-CX* operating system release 8.07. and consists of device 
drivers, an audio server, an audio data type conversion library 
(CUfib). an audio library (Alib). an audio toolkit, and a widget 
set. The audio library architecture is similar to Xlib in the 
X Window System; this aids in the integration of audio into 
X windows applications. 

The audio library provides the application program interface 

10 the audio application. An application can manipulate the 
characteristics of the audio device, prepare an audio file to 
be played, and lell ihe audio device lo play the audio file via 
calls provided in the audio library. Similarly, an application 
can record audio data into an audio file. Audio dala streams 
and raw audio data are supported by Ihe audio library, which 
allows an application to play or record without having data 
files. An audio toolkit and audio widgets are designed to 
work well w ithin ihe X toolkit and widgets environments. 

Audio Hardware Design. The low-cost nature ot the Model 
705/710 audio design led lo the use of a telecommunications 
CODEC (coder/decoder) with S-bit u-law, 8-bit A-law. and 
16-bii linear sigma-della conversion, software-programmable 
gains, filtering and output destination selection, and ring and 
lone generators. The CODEC provides the analog-to-digital 
and digital-to-analog front end of a lelcphone/iuodem/fax 
connection w ith sufficient qualily for voice recording and 
playback. Fig. 7 shows a diagram of Ihe Model 705/710 audio 
subsystem. 

The two 8-bit conversion algorithms I u-law and A-law) are 
voice compression standards for North America and Europe, 
respectively. Both compress a l-'t-bil Sample into an 8-hil 
sample without losing significant voice-content information. 
A 16-bii mode can be used for future modem/fax front ends 
The converter itself is of a dclta-sigma design, which makes 

11 possible to use inexpensive single-pole reconstruction and 
anti-aliasing filters rather than more precise filters which are 
more expensive in both cost and printed circuit board area. 
Additional anti-aliasing filleting is done w ith a built-in digital 
signal processor lhal also prov ides separate receive and 
transmit gain or attenuation and some frequency equalization. 

The ( '< »DE< ' is connected lo the processor through a 128 
byte transmit FIFO buffer, a 128-byte receive FIFO buffer, 
and a 16-byte command FIF< > buffer. The transmit (out lo 
speaker or headphones) FIF< ) interrupts Ihe processor 
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whenever the FIFO is less than half full so thai processor 
interruption frequency is minimized. Similarly, the receive 
(data in from microphone) FIFO inlemipts the processor 
when the FIFO is more than half full. At the S-kHz sample 
rate, in 8-bit mode, the processor has 8 ms to respond to the 
interrupts before samples are dropped. The operation of 
these FIFOs can be fully overlapped with the command 
FIFO, which is used to send control information to the 
CODEC to set gains, filter coefficients, output destinations, 
and so on. 

Input/Output 

To minimize the overhead typically associated with manag- 
ing less sophisticated interface controllers, the Model 
705/710 design incorporates intelligent controllers for the 
SCSI II and LAN functions. These controllers access mem- 
ory using efficient 32-bit DMA interfaces to reduce data 
latency and minimize system bus and memory use. 

The SCSI controller used in the Model 705/710 is the 
NCR53C700. This single-chip controller manages the single- 
ended SCSI II bus protocol and provides an intelligent inter- 
face to system software. A 32-bit DMA engine, coupled with 
an internal 32 -byte FIFO, is used for efficiently moving data 
to and from the 5-Mbyte/s SCSI bus and main memory. 

The LAN controller used in the Model 705/710 is the Intel 
82596DX. This single-chip controller, along with an Intel 
82C501 encoder/decoder chip, implements the LAN auxiliary 
unit interface (AIT). A transceiver is used for the ThinNet 
BNC connection. The controller manages the L\N protocol 
and provides an intelligent interface to system software. A 
32-bit DMA engine, coupled with internal 128-byte receive 
and 64-byte transmit FIFOs, produces an efficient interface 
to system memory. 

The Model 705/710 keyboard and up to six other IIP-HIL 
devices connect to the system through the IIP-HIL interface. 
The design also makes available two full modetu-conlrol 
RS-232 ports. These ports are implemented with 9-pin D-sub 
connectors. For high-speed parallel communication, the 
design implements a bidirectional version of the Centronics 
interface, allowing for the connection of input devices such 
as the HP ScanJet seamier. 



Performance 

The "true" performance of a workstation from any particular 
user's perspective depends heavily on the application work- 
load to which the system is exposed. For the purposes of 
this paper, we depend on certain benchmarks that in many 
cases approximate the characteristics of typical application 
workloads. We have chosen the SPEC and Dhrystone bench- 
marks to characterize the processor and memory perfor- 
mance of the system, and the XII pert benchmark and some 
peak vector performance numbers to characterize the 
graphics performance. We have selected I l ie desktop HP 
9000 Model 720 and 730 workstations for our processor and 
graphics performance Comparisons because of their desktop 
performance leadership. 

A comparison of the processor performance of the Models 
705 and 710 and other desktop members of the Series 700 
workstation family is shown in Fig. 8. 

Fig. 9 shows the relative XI 1 and 2D/3D vector graphics 
performance of the Model 710 and other desktop Series 700 
family members. 

Conclusion 

The HP 9000 Model 70S is an entry-level, low-cost, desk- 
top color workstation providing high levels of application 
performance. The system offers attractive levels of price/ 
performance for cost -sensitive users and applications. The 
design produces a balance of integer, floating-point, mem- 
ory, graphics, audio, and I/O capabilities at an affordable 
desktop price. The Model 710 is a desktop color workstation 
that prov ides very high levels of performance for cost- 
sensitive customers. 

The development project for the Models 705 and 710 relied 
on a single design and standard ASIC technology instead of 
custom VLSI to provide high performance, fast turnaround, 
and low cost. This resulted in a 50-MIIz design that was 
completed and released for shipment in just under 12 
months from initial concept. Other factors contributing to 
this short development included the use of high-level hard- 
ware description languages, the realization of gates through 
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synthesis techniques, and the ASK' design and release meth- 
odology employed by Hewlett-Packard. Hardware ASIC 
emulation was also used to supplement the logic design veri- 
fication portion of the development to guarantee the correct 
design of our ASICs. The system board and First -pass ASICs 
were fully operational and running the HP-l"X operating 
system in a networked environment (with X Windows) just 
minutes after deliver,- of the first and only pass of silicon. 

Ac k n o w I e dgin e n is 

The authors would like to tliank the many individuals in- 
volved in making the Models 705 and 71(1 possible. These 
include those who contributed directly to the project, as 
well as those who contributed indirectly through design 
efforts that were leveraged. These leveraged designs in- 
cluded the other Series 7(H) PA-RISC products, as well as the 
Series 40ii Motorola OSK products. 
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Low-Cost Plain-Paper Color Inkjet 
Printing 



The HP DeskWriter C and DeskJet 500C printers are based on advanced 
thermal inkjet technology in the form of a 300-dpi three-color inkjet print 
cartridge. The printers and software drivers that use this cartridge were 
developed on an aggressive one-year schedule. 

by Daniel A. Kearl and Michael S. Ard 



The use of color in written communications has revolution- 
ized the printed-output industry. The ability of vivid colors 
to draw attention and stimulate retention is immense. Eiigh- 
density color monitors anel high-<|iiality color applications 
for personal computers have reset customer expectations. 
However, an affordable, plain-paper color printing solution 
has not been available. 

The introduction of t he HP DeskWriter C printer in the 
summer of 1991, followed by the IIP DeskJet 500C printer, 
set new standards for low-cost color printing. 1 - At the heart 
Of these printing solutions is HP's three-color 300-dpi ther- 
mal inkjet print cartridge. This print can ridge provides the 
printer with the ability to deliver high-quality 300-dpi color 
images and graphics on a wide variety of "plain" ofnce pa- 
pers. This 300-dpi plain-paper color capability represents a 
major increase in price/performance for personal printers, 
it is a good example of a technology-enabled performance 
increase. Pig. I shows the IIP DeskWriter C and DeskJet 
500C printers and Pig. 2 shows examples of their output. 



Color Print Cartridge 

The color print cartridge for the HP DeskJet 5000 and Desk- 
Writer C printers is another extension of HP's thermal inkjet 
printing technology (see "Thermal Inkjet Review...," page 67 
and references 3. 4, and 5). This technology was first made 
available in 1984 with the introduction of t he HPThinkJet 
line of printers. These printers initially used a black print 
cartridge that had 90-dpi resolution and required special 
paper. Later enhancements brought plain-paper and single- 
color printing to the personal printer user. In 1987 HP intro- 
duced the HP PaintJet series of printers. These printers pro- 
vided a fully integrated color printing capability on special 
paper at 180-dpi resolution. The price and performance of 
the IIP PaintJet printers represented a significant advance at 
the time of their introduction. The I IP DeskJet line of print- 
ers was introduced in 1988. These printers offered high- 
quality 300-dpi black printing on a wide variety of office 
paper types. As noted above, the color-capable versions of 
the DeskJet family were introduced in the latter part of 




Fig. 1. HP DeskWriter C and 
DeskJet 500C color thermal Inkjet 
printers. 
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Fig. 2. Examples of HP Desk- 
Writer (* and DeskJet 5II0C 
printed output 



1991, the HP Desk Writer C" printer for Macintosh computers 
and the HP DeskJet 500C for the PC. Print cartridge technol- 
ogy development has played a critical role in the successful 
introduction of each of these generations of thermal inkjet 
printers. 

Reduced to the simplest terms, the HP DeskJet 500CV 
DeskWriter C color print cartridge is a compact, low-cost, 
high-resolution color dot generator. The printer dictates to 
the print cartridge when and where to deliver the color dots. 
This particular cartridge delivers three different colors of 
dots: cyan, yellow and magenta. These colors are known as 
suhtractive primary colors. The size, shape, and optical 
properties of the dots produced should ideally he indepen- 
dent of the media and I he printing environment. These dots 
should be delivered to the media at a very high rale of speed 
with a positional accuracy commensurate with the high res 
olution of the device. All of the dot generator properties 
should remain constant over the storage and printing life of 
the print cartridge. These relatively simple color dot genera- 
tor performance goals represented a major development 

challenge for the HP Inkjet Components Division. 

In outward appearance, the color print cartridge for the 
IIP DeskJet 500C and DeskWriter C printers is very similar 
to the original IIP DeskJet black print cartridge (Fig. :$). The 
plastic body of the print cartridge has been enlarged some- 
what to provide room for the three individual ink reservoirs. 
Electrical interconnection to the printer is accomplished 
using the same flex circuit technology as the black print 
cartridge. The location of the pressure interconnection pads 
is identical to those on the black print cartridge. Provisions 
have been made in the printer drive electronics to sense 
which can ridge has been installed and respond with the 
appropriate drive signals. 

In general, the thermal inkjcl drop generator portion of the 
color print cartridge for the IIP DeskJet 500C and Desk- 
Writer C represents a natural extension of the existing IIP 
DeskJet black print carl ridge technology. Careful inspection 
of the gold-plated nickel nOZZle assembly shows that the 



nozzles have been arranged into three groups or primitives, 
one for each of the suhtractive primary colors (Fig. 4). The 
three primitives all share a common silicon thin-film sub- 
strate and circuitry- This thin-film substrate is fabricated 
using processes very similar to those used for the IIP Desk- 
Jet black and HP PaintJet print cartridges. 1 " 1 The primitives 
are staggered with respect to one another to provide separa- 
lion for the ink delivery channels on the back side of the 
substrate. Each of the primitives consists of two columns of 
eight nozzles each. The nozzles within a given column are 
spaced on 1/150-inch centers. Each column in a given primi- 
tive is offset with respect to its neighbor by 1/300 inch, so 
lhat the combination of the two nozzle columns results in an 
array of 16 nozzles with a vertical resolution of 1/300 inch. 

The major technology contributions in this color print car- 
tridge are in the areas of ink chemistry and manufacturing 
processes. Ink formulation is the key to producing high- 
quality plain-paper color images and graphics. A discussion 
of the nature of plain paper has been presented in an earlier 




Fig. 3. Tin- original HP DeflkWrfter/Desfctei 500 black print car- 
tridge (I) ami the new Ihrep-eulor III' DeskWriter l VDesk.lel 5IMIC 
;intl-ilpi print earl ridge (r). 
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Fig. 4. The nozzles of the lliree-color print cartridge are arranged 
in three groups or prim/lii vs. Each primitive prims one of the 
tliree primary rolors: cyan, magenta, and yellow. 

issue of tliis publication. 1 ' The very substantial plain-paper 
color ink fomiula developmenl effort is the subject of the 
article "Ink and Print Cartidge Development for the IIP 
DeskJet 500C/DeskWriler C Printer Family." which follows 
this article. Several challenges unique to color printing were 
met in this ink developmenl project. 

The additional complexity of three independent ink reser- 
voirs presented substantial challenges in the print cartridge 
manufacturing area as well. These challenges are discussed 
in three articles. The first. "Automated Assembly of the HP 
DeskJet 50(K7DeskWriter C Color Print Cartridge" (page 77) 
provides insight int o several of the problems of high-volume 
manufacturing of this three-color thermal inkjet device. 
Machine vision is extensively used in these manufacturing 
operations, and is the topic of t he article "Machine Vision in 
Color Print Cartridge Production" on page 87. Finally, the 
demanding applications of adhesives technology are dis- 
cussed in the article "Adhesive Material and Equipment 
Selection for the HP DeskJet 500C/DeskWriter C Color Print 
Cartidge" on page 8-1. 

Fast-Track, Leveraged Product Development 

The HP DeskWriter C and DeskJet 500C printer designs 
were leveraged from the successful HP DeskWriter and 
DeskJet 500 printers, both high-quality 300-dpi monochrome 
products. Leveraging from successful products still required 
enormous effort to deliver color printing solutions to market 
in approximately one year. Prioritized and focused program 
execution by all departments and functional areas was criti- 
cal in accomplishing the following major steps in this 
aggressive development program. 

First, there was an immediate focus on key feature-set re- 
quirements and crucial program objectives. The urgent mar- 
ket opportunity drove the schedule as the primary objective. 
Early customer research in the form of extensive phone 
screening and user focus groups provided important data on 
required features, applications, and pricing. Usability studies 
were also conducted to verify the acceptability of a one- 
cartridge color-printing solution in terms of both ease of use 
and throughput performance. With this data, the product 
team identified the critical market "must" features, leaving 
the balance of design opportunities for future product revi- 
sions or new product dev elopments. This research data also 
facilitated trade-offs required late in the development stages 
to meet aggressive schedule objectives. 



Next , there was intense execution of the program objectives 
by I he development staff. Participating on a leading-edge 
product with such apparent potential provided great team 
motivation. At the heart of this team effort was an experi- 
enced core of engineers whose insights and awareness were 
invaluable in making real-time design decisions and trade- 
offs to support a market-driven schedule. The product devel- 
opment teams were organized into two primary groups: 
printer-product and environment -driver developmenl. Printer- 
product development consisted of a mechanical team and an 
electronic and firmware team. Environment-driver develop- 
menl included a Macintosh driver team and a Microsoft 5 
Windows driver team. 

Each development group faced significant design issues. 
The printer-product group focused on supporting the new 
color and monochrome print cartridges. The mechanical 
team focused on physical pen support, including carriage 
mounting, movement, electronic interconnect, service and 
storage of the print cartridges, plus enhanced media flexibil- 
ity ami support The electronic and firmware team pursued 
development of color support algorithms and logic. These 
required key architectural enhancements and additions, plus 
complex host interaction and task partitioning. 

The environment-driver group supported the majority of the 
color science requirements, including intelligent formatting, 
rasterizing, color matching and dithering algorithms for opti- 
mum output results (see "HP DeskWriter C Printer Driver 
Development." page 93). The contribution of the environ- 
ment drivers in providing an extensive, high-quality, color- 
optimized solution was of major significance. An example of 
this is the printers' support of over sixteen million unique 
colors based on 24-bit color data. The drivers also provide 
ways to overcome limitations of the printhead and ink 
technologies. An example is the extension of the environ- 
mental print range by providing user-adjustable intensity or 
ink depletion settings for the output. 

Because of the tight schedule, the development activities 
tended to parallel standard textbook approaches to fast- 
track product development. There was a clear market focus, 
clearly defined program objectives, a firm product defini- 
tion, total site commitment to the development effort, expe- 
rienced development and management teams, process flexi- 
bility, and significant leverage opportunity. Unique to this 
product development were several activities that had signifi- 
cant impact on minimizing schedule and reducing design risk. 

First, the HP DeskWriter C printer for the Macintosh envi- 
ronment was targeted for initial introduction, with the Desk- 
Jet S0OC for the PC market offset by two months. This al- 
lowed initial attention to be focused on the Macintosh 
environment solution. Key developments and insights from 
the Macintosh printer and driver were then leveraged and 
applied to the PC environment printer and the Microsoft 
Windows driver. 

Next, effort was made to ensure that ample engineering re- 
sources were available for this fast-track program, especially 
on the mechanical team. This provided flexibility to move 
development resources where they were most urgently 
needed, when they were needed. Understaffing fast-track 
development projects is a major failure risk. 

(continued on page 38) 
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Thermal Inkjet Review, or How Do Dots Get from the Pen to the Page? 



Orifice 



Firing Chamber 




Silicon Substrate 

Ink Inlet Area 

Fig. I. An exploded cross-sectional view of a single tnkjet nozzle 

The print cartridge for the HP DeskJet 500C/DeskWnter C printers delivers dots to 
the page using thermal inkjet technology The fundamental technology is the same 
as that used in previous Hewlett-Packard inkjet products such as the HP ThmkJet. 
PaintJet, and DeskJet printers. Detailed descriptions of these products 1 and 
more in-depth discussions of thermal inkjet theory and background'- 5 can be 
found in the literature. 

In its simplest form, an inkjet device consists of a tiny resistor aligned directly 
below an exit orifice Ink is allowed to flow into the resistot area, and when the 
resistor is heated, the ink on the resistor essentially boils and forces a tiny droplet 
of ink out of the aligned orifice This is called firingibe nozzle 

A cross-sectional view of a single inkjet nozzle is shown m Fig 1 On the floor of 
the firing chamber is a resistor. This resistor is patterned onto a silicon substrate 
using conventional thin-film fabrication procedures Leads are connected to the 
resistor through the thin-film substrate These leads ultimately 'ravel out to the 
flexible circuit on the body of the print cartridge, through which a voltage can be 
applied across the resistor The resistor is the heart of the thermal inkjet device 
and the size of the resistor is the primary factor governing the volume of the 
ejected droplets. 

The walls of the firing chamber are made up of a photosensitive polymer This 
polymer serves to define the walls of the fifing chamber and determines the spac- 
ing between the resistor surface and the orifice The thickness of this photosensi- 
tive barrier and the dimensions of the firing chamber are critical to the production 
of a well-formed droplet 

The photosensitive polymer also defines the dimensions of the inlet area to the 
firing chamber Ink enters into the firing chamber through this inlet area Like the 
harrier thickness, the inlet dimensions greatly affect the characteristics of the 
ejected droplet 

Finally, a gold-plated nickel orifice plate sits on top of the barrier An orifice is 
formed ffl this plate directly above the hung chambei Tins orifice hole is formed 



using an electroforming process Tne diameter of the orifice has a direct bearing 
on the volume and velocity of the ejected droplets 

To fmj a drop, a voltage pulse is applied across the resistor. This pulse is typically 
very short, on tne order of 2 to 5 microseconds m duration The voltage pulse 
causes the resistor to neat up, temporanry bringing the resistor surface to temper- 
atutes up to 400'C Heat from the resistor causes ink at the resistor surface to 
superheat and form a vapor DuDcfe formation W this vaprr twooie :s a fast ano 
powerful event and expansion o' tne bubble forces some of the ink in bhe fmng 
chamber out of the orifice at velocities of typically 10 meters per second 

By the time a droplet is ejected, tne resistor has cooled down and the vapor 
bubble has collapsed Through capillary forces, more ink flows into tne firing 
chamber through the inlet area, tnus readying the system for the firing of another 
droplet The frequency at which the printhead can repeatedly fire droplets is 
determined by several factors including the inlet dimensions, the barrier thickness, 
and the fluid properties of the H 

The device described above is essentially a droplet generator The device designer 
has a fair amount of control over the characteristics of the ejected droplets. For 
example, the volume of the ejected droplet can be controlled by changing the size 
of the resistor — bigger resistors give droplets of larger volumes In addition, the 
diameter of the orifice can be used to control droplet volumes Droplet velocity is 
also controlled primarily by the diameter of the orifice 

The frequency at which droplets are ejected can be controlled by altering the size 
and shape of the barrier and by changing the Theological properties of the ink. For 
the particular design chosen for the HP DeskJet 500C/DeskWriter C printhead, and 
for the ink fluid characteristics, the typical operating frequency of the printhead is 
about 3 kHz. 

Droplet characteristics, as they relate to print quality on the media suriace. can be 
optimized through careful control of orifice profiles and resistor/orifice alignment 
Ink, which also has a dominant effect on print quality, is discussed in more detail 
in the article on page 69 Ink properties such as surface tension, viscosity, and 
thermal stability all play important roles in the production of useful droplets. 

In real-life inkjet printheads, multiple nozzles are lined up on a single silicon sub- 
strate/orifice plate assembly to form an array of droplet generators. Fig. 4 on page 
66 shows the layout of nozzles for the HP DeskJet 500C/DeskWnter C print car- 
Iridge These nozzles can be fired in rapid succession as the printhead is scanned 
across a sheet of paper or other media The firing of these multiple nozzles at high 
frequencies under control of a microprocessor can ptoduce high-resolution, high- 
quality text and color graphics at scan rates of 1 0 to 20 inches per second. Thus 
we see that thermal inkjet technology offers a compact, highly tunable method for 
delivering droplets from the pen to the page 
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A substantial and thorough testing effort was put in place 
early in the program. Specific and tailored test processes 
covered both the printer-product and the environment-driver 
developments. This investment allowed early identification 
of design defects and concerns. Where changes were re- 
quired, early identification enabled easier and more cost- 
effective solutions while minimizing schedule impact. 

Aggressive schedules were broken into short development - 
based milestones. This allowed close monitoring of prog- 
ress, and kept all developmental activities visible. Issues and 
concerns were monitored at the program level on a weekly 
basis to ensure that problems were addressed and resolved 
expedit iously. 

Kinally, frequent prototyping throughout the development 
cycle allowed immediate evaluation of design solutions. 
Prototype builds in large numbers provided printer units 
to facilitate all cross-functional development, testing, and 
extended support activities. To meet fast -track, aggressive 
schedules, .an organization must be willing to spend money — 
especially on prototypes in sufficient quantities to meet a 
large demand. Outstanding suppoii by manufacturing and 
the model shop In this regard was crucial. Similarly, early 
availability of print can ridges in large quantities was essen- 
tial to keeping fast -paced development activities on sched- 
ule throughout the design, implementation, and test cycles. 
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Ink and Print Cartridge Development 
for the HP DeskJet 500C/DeskWriter 
C Printer Family 

A new tnchamber print cartridge allows the low-cost HP DeskJet printer 
platform to print in color. The ink vehicle, dyes, dye concentrations, and 
interactions had to be carefully traded off to optimize performance with 
respect to color bleed, color saturation, composite black production, edge 
acuity, drying time, and resistance to crusting. 

by C raig Maze. Loren E. Johnson. Daniel A. Kearl, and James P. Shields 



Development of the print cartridge for the IIP DeskJet 500C* 
and DeskWriter (' primers required a continuation of ink 
chemists, print cartridge architects, and design engineers, 
along with inputs front the fields of color science and 
product marketing. This article deals with the design and 
development of the inks and the print cartridge. 

The major Objective of the printer development project was 
to provide a low-cost desktop printer that produces laser- 
quality black print and also offers color capability to the 
user. To enable the low-cost HP DeskJet printer platform to 
print in color, a Irichamher color print cartridge was de- 
signed thai can he exchanged for the black print cartridge 
when color printing is desired. 

Incorporating 8 trichamber color print cartridge into the 
existing IIP DeskJet print platform posed several challenges 
for ink chemistry and print cartridge design. Like the ink for 
the black-only IIP DeskJet printer family, the color inks had 
to work on a wide variety of "plain" papers. Plain paper 
priming using thermal inkjei technology has been reviewed. 1 

I tne Important consideration is ink drying time. In addition. 

when inks of different colors are laid down next to one 
another on the paper, they must not bleed or diffuse into 
adjacent regions of different color, a fault commonly re- 
ferred to as color bleed. A significant level of c olor bleed is 
undesirable and causes the border between the two colors 
tO appear ragged and undefined. 

Another challenge involved the production of composite 
black. This term refers to the production of black using the 
three primary colors (cyan, yellow, and magenta) from the 
trichamber prim cartridge. Composite black results from 
printing dots Of cyan, yellow, and magenta directly on top of 
each Other. This requirement exists because only the black 
print cartridge or the color print cartridge can reside in the 
primer stall at any given lime. Thus, when the color car- 
tridge is installed, any black print must be produced using 
composite black. As discussed Inter, this required some 

tradeoffs between the composite black color characterise 
lies, the color quality of the primary anil secondary colors, 
and ink robustness. 



Finally, the greatest challenge was perhaps in the develop- 
ment of an ink that performed well on a wide variety of plain 
papers. These paper types range from high-quality cotton 
bond papers to the papers used in high-speed copiers. Find- 
ing an ink i hat met all of the challenges involved delicate 
trade-offs among objectives that were often in conflict. 

Evolution of the Ink Vehicle 

Water-based color inks have evolved from formulations de- 
signed to print on special paper to those capable of printing 
on plain paper. Many hundreds of different mixtures were 

tried before obtaining satisfactory performance over a large 

variety of papers. In general, inks changed from those con- 
taining large amounts of diethylene glycol, about till 1 !'", to 
those containing different organic solvents at greatly lowered 
concentrations. 

Some of the requirements on the ink are in conflict For ex- 
ample, drying must be slow in the print cartridge to prevent 
plugging of orifices and the associated loss of print quality, 
hut rapid on paper to facilitate paper handling and maintain 
printing speed. Color bleed, if loo great, will ruin a print 
sample (see Fig. 1). Ii is also one of the mosl difficult param- 
eters to control by ink formulation alone Composite black 




Fig. i. < lolor bleed is iin- undesirable mixing of two colors printed 
next to each other, (a) Unacceptable color bleed! d>i Minimal 
color bleed 
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text print quality must be high, which means a dark compos- 
ite black and high edge acuity. Unfortunately, ink formulas 
that give improved text print quality tend to have increased 
amounts of color bleed. 

It is difficult to meet all of these requirements, and some 
compromise is unavoidable. Market survey? were used to 
separate performance factors I hat are important to our cus- 
tomers from those that matter least. Samples were printed 
with known defects on different papers over a range of 
temperature and humidity conditions and with different 
inks. These were submitted to potential customers for their 
evaluation and the results were used to guide ink develop- 
ment. For example, high color saturation was a strong cus- 
tomer requiren tent, and ink component concern rat ions were 
adjusted accordingly. 

The first major challenge was to provide a formulation that 
would not bleed and would deliver saturated colors. Bleed 
was suppressed by adding surface-active reagents to speed 
penetration of the ink into the paper. This reduces the lime 
available for dyes to diffuse on the paper and for colore to 
mix. Unfortunately, penetration into the paper reduces color 
saturation, so allowances were made to keep color saturation 
high. 

High-quality composite black text print requires a dark black 
with high character edge definition. The former is achieved 
by keeping the dye on the surface of the paper, and the latter 
by minimizing penetration of the ink into the paper. These 
requirements can be at variance with fact ore that control 
bleed. 

It became apparent that to deal with the various trade-offs 
In an efficient manner a different experimental approach 
was needed. After some exploratory work, these Irade-offs 
were deall with all at once by conducting a series of designed 
mixture experiments. Optimum component concentrations 
were obtained that achieved acceptable bleed control, color 
gamut, edge acuity, and color saturation. In general, solvent 
concentration was kept low, surface tension was reduced, 
and viscosity behavior modified. 

Dye Selection Criteria 

In selecting and using the dyes for the HP DeskJet 500C7 
Desk Writer C color print cartridge, four primary al tributes 
were considered. Any dye to be considered for use had to 
address all four of the following concerns. 

The first performance parameter is resistance to crusting of 
the ink (nozzle clogging upon loss of vehicle). This parame- 
ter was tested by allowing print cartridges lo sit uncapped 
for various lengths of time and then firing them and looking 
for lost nozzles. 

Several characteristics of a dye determine how resistant it 
will be to crusting. The purity of the dye plays a very impor- 
tant role. The presence of any diluents, reaction by-products, 
secondary colore, or unreacted feedstocks can degrade 
crusting performance. As a consequence, some initial effort 
was expended in cleaning up the dyes that looked promising 
so that crusting performance could be reliably evaluated. 

The structure of the dye also plays a very important role in 
the crusting performance of an ink. As a nozzle sits un- 
capped, exposed to the air. the more volatile components of 



the ink evaporate. The solubility of the dye in the resultant 
evaporated ink at the nozzle influences crusting perfor- 
mance. This solubility is determined in pan by the structure 
of the dye and its counterions* Dyes I hat exhibited enisling 
problems because of their structure were eliminated from 
further consideration. 

The eounteiion used lo balance I he charge on the dye mole- 
cule also effects the enisling performance of the dye. In many 
instances this eounteiion can be changed to improve enisl- 
ing performance. For dyes lhal showed good performance in 
other respects, the counterion was altered to maximize 
enisling performance. 

A property of thermal inkjei performance called "kogation" 
(from the Japanese kogasu, to scorch) is the second primary 
attribute thai is important to dye selection. Kogation occurs 
when the ink. which contains the dye, is exposed lo the high 
temperatures generated on I he printhead resistor surface 
during operation. These high temperatures sometimes break 
down the dye inlo insoluble fragments thai slick lo the resis- 
tor surface. Alternatively, the dye itself can be bound to the 
surface. The buildup of this carbonaceous layer on the resis- 
tor reduces heat transfer to the ink. In severe cases the 
print cartridge will no longer fire because of the insulating 
character of the kogation. 

Dyes are tested for kogation performance by tiring inks con- 
taining the dyes to be tested from a print cartridge. As the 
print cartridge is tired the volume of the ejected drops is 
monitored. The presence of kogation lends to reduce the 
drop volume over Ihe life of the print cartridge, resulting in 
degraded prinl performance. 

Structure and eounteiion play a role in dye kogation. Dyes 
thai exhibited poor kogation characteristics because of their 
structure were discarded. In some cases, kogation can be 
improved by altering the counterion attached to the dye. 

The third primary attribute used for dye selection is the 
color characteristics of the dye. Hue and chroma are param- 
eters of utmost importance to a color printer. Dyes were 
chosen to hav e hues within a given range so that a balanced 
palette could be generated. Dyes were selected to have the 
highest chroma characteristics possible. 

Finally, given lhal Ihe dyes must function in a Irichambered 
print cartridge, where the same nozzle plate is shared by the 
three different primary colors, dye-to-dye interactions had 
to be considered. A dye must not read adversely with any 
other dye used in ihe ink set. Examples of such reactions 
are precipitation and hue change. 

Color Balancing and Dye Concentration Adjustment 

Once a set of dyes was selected using the above criteria. Ihe 
concentrations were adjusted to give good secondary and 
primary hues and to achieve maximum saturation. These two 
goals had lo be achieved without sacrificing the robustness 
of ihe print can ridge performance. 

(continued on page 73i 

Solutions must De electrically neutral A counterion 15 an inn mat balances out the charge 0' 
another ion In solution, that is. its charge is the opposite of that of the ion ,n question Thus, 
for e»ample. a negatively chargeo dye molecule might have a positively charged sodium ion as 
its counterion 
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Color Science in Three-Color Inkjet Print Cartridge Development 



During me development of me three-chamoer color print cartridge mere was a 
need to describe and measure We colors that were printed There are a number 
of different wars to deserve colors The purpose of this article is to explain 
how colors are generated with the color print cartridge, what color means, now 



How the Color Print Cartridge Prints Colors 

The print cartridge has three different inks cyan, yellow, and magenta ICVMI 
Eight colors can be generated at 300-dpi (dots per mchl resolution py printing 
different combinations of the primary inks The eight colors are printed as follows- 

• Not printing any ink gives "wnite." the paper color 

• Printing one of the primaries gives cyan (a greenish bluel. yeliow. or magenta (a 
bluish red) 

• Printing a combination of two primaries gives the secondary colors, which are red 
(a mix of yellow and magenta), green (a mix of cyan and yellow), and blue la mix of 
cyan and magental 

• Printing all three pnmaries gives the tertiary color black 

Note that the various colors are defined bv the printing process, so the names 
given to them here Ired, olue. etc ) are not necessarily accurate 

More Colors with Halftoning 

To obtain a larger number of colors using the print cartndge. the ratio of the CYM 
proportions must be changed This is not possible at 300 dpi because the HP 
DeskJet 500C is a binary printer. Halftoning algorithms are used to increase the 
number of printable colors Each picture element is composed of a number of dots 
of mk (pixels) In a simple case, each picture element is composed of 4 pixels in a 
2-by-2 halftone cell (sometimes called a superpixell This yields an effective reso- 
lution of 150 dpi Process yellow is now 4 drops of yellow ink m the 4 pixels, 1 
drop per pixel Process red is 4 drops of yellow and 4 drops of magenta, 1 drop ol 
each ink in each pixel By reducing the number of magenta drops to 3. 2. or 1 , 
three more colors can be produced between yellow and red This process can be 
extended to larger halftone cells lo increase the number o( available colors. There 
are many different halftone algorithms which yield different colors and textures 
but all have the ability to increase the number of colors at the expense of resolu- 
tion For all these print methods, any color available to the HP DeskJet 500C 
printer can be produced by specifying how much of the CYM inks to print and 
where to place the inks This leads to the concept of a process color space 

Process Color Space 

As stated earlier, any color achievable with the color print cartridge can be de- 
scribed III terms o' the amounts of CYM being printed for a particular halftone cell 
This is an efficient color space for the printer but is not useful for describing the 
printed color The problem with process spaces. In general, is that they are system 
dependent All printers have the same process color space — the proportions of 
CYM being printed Unfortunately, the color printed is not constant, because 
different printers have different CYM inks This was apparent in developing the 
inks for the HP DeskJet 500C print cartridge Different dyes and vehicles were 
formulated, all yielding different colors upon printing A better description of color 
other than the process space was needed 

Visually Based Color Spaces 

Visually based color spaces or color order systems are created by empirically 
arranging a set of colors according to the observed relationships between adja- 
cent colors. The criteria describing the relationship can be varied, yielding differ- 
ent color systems Color order systems are all three-dimensional, independent of 
the sorting criteria These spaces are very useful because the colors are "uniformly" 
spaced and based on human perception A unique value can be placed on any 
color by visually matching the sample color to an example from a standard color 
order system. 

A widely used color order system is the Munsell notation Colors are described by 

• Value, a measure of darkness anil lightness 

• Hue, the shade of the color 

• Chroma, the "brilliance" uf the color 



This three-dimensional color space has cylindrical coordinates value is the i axis 
chroma is the radial distance, and hue is me angle It is a satisfying color coordi- 
nate system for describing colors because n arose from an empirical sorting of 
colors into a standard set The Munsell color soace is a uniform color space but 
nas problems m oav-to-dav use To use mis color order system a visual compari- 
son oetween the sample color and the standards has to be made Tms is prone tc 
e«cs and is iime-tonsuming. and the exact standard required must oe aw'fa&le 

Measurement-Based Color Systems 

Tne optimal system would be a measurement system mat yields values Wat 
correlate with a color order system This nas been achieved using the CIE color 
measuring system 

For a particular observer and illummant. colors can be uniquely described by three 
coordinates Therefore, to measure colors. three values need to be obtained A 
one-to-one mapping between me three measurement values ana a visually based 
color space must exist for the measurements to be truly useful 

Tnree components are necessary for color measurement. 

• A light that illuminates the sample 

• A sample that has a characteristic ref lectance spectrum mat changes the relative 
spectral distribution of the light source 

• An observer that detects the reflected light. 

Common standard light sources are D65. a representation of daylight. A, repre- 
senting incandescent light, and F2 or CWF, representing "cool white fluorescent" 
lights. Standard detectors include the CIE 2-degree and 10-degree oaservers. At 
HP we have typically used the DB5 source and the 10-degree observer This source 
and observer have been selected and standardized to give me CIE system of color 
measurement 

The CIE system applied to a reflectance spectrum yields X. Y. and Z tristimulus 
values. These values can be used to specify any color for a given observer and 
source Unfortunately, the mapping of the XYZ space onto a visually based color 
space such as the Munsell space is very nonlinear Many transformations of the 
XYZ space were investigated to yield a uniform measurement space. A common 
transformation is the CIELAB color space The coordinates of this space, called L' 
a', and b', map Quite well into the Munsell space except that a rectangular coor- 
dinate system is used to describe the colors rather than a cylindrical coordinate 
system, as shown in Fig 1 

Color Measurement 

Once the source and the observer are specified, the reflectance spectrum of the 
sample determines the measured color In developing the color print cartridge, the 
rellectance spectrum was considered to be the only variable under the control of 
the printer The ink. print cartridge, media, and print modes had to be developed 
to deliver the appropriate reflectance spectra and therefore colors to meet the 
market needs. 

Color measurement can be done with two types of instruments either a colori- 
meter or a color spectrophotometer A colorimeter is designed to measure the XYZ 
tristimulus values directly, relative to a specified llluminani and observer A spec- 
trophotometer measures the reflectance spectrum such that the XYZ tristimulus 
values for any combination Df source and observer can be subsequently calculated 
A colorimeter is useful for relative measurements while a spectrophotometer is 
designed more for absolute measurements. The flexibility of the spectrophotometer 
comes at an additional cost, as might be expected 

Further Manipulations of CIELAB 

CIELAB is a rectangular coordinate system in which L" is the lightness color coor- 
dinate, a* is the red-green coordinate, and b" is the yellow-blue coordinate 

It is often useful to monitor the color of a sample using hue and chroma metrics 
because these yield information similar to the Munsell system In the CIELAB 
system, hue (hi can be defined as the arctangent of the a* and b" coordinates, 
and chroma IC) is the radial distance from the L' axis as represented In Fig 1 
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Fig. 1. 3D representation of the CIRAB space showing prelections anlu the a'ti* and 
C"h planes 

h = arctan(a*/tj*) 

C" = > a* 2 + b' 2 

Hue anfl chroma derived from the CIELAB space ate not the same as the Munsell 
hue and chroma 

Another quantity derivable from the CIELAB values is color "saturation." which 
gives a sense of the depth of color: 

S* = C7L*. 

where S' is saturation and C and L* are defined above 

The difference between two colors can be easily described using the CIELAB 
system Since each color is described by three coordinates, the color difference 
can be given by the vector length between the two colors, as shown in Fig. 2 A 
color match is obtained when the vector difference is zero, or the two colors have 
the same L*. a", and b' values The vector difference between the two colors is 
called AE* and is given by the following equation. 



AE' = vAL' 2 + Aa' 2 + Ab* z , 

where AL* is the difference in L" values. Aa" is the difference in a" values, and 
Ab* is the difference in b* values between the two color samples 

Other color difference metrics can he defined, such as AC*, the difference in 
chroma Care needs id be taken with AE' because the critical AE* for a color 
change to be noticeable depends on the color of the sample Yellows are forgiving 
while reds are not 

The selection of the appropriate color difference metric depends on the task for 
which n is being used AE* is a scalar quantity it gives the size of the color differ- 
ence but not the direction Breaking AE" into its components often yields mote 
useful data because the direction and mam component of the color difference can 
be obtained 



Uses of Color Measurement 

Process Control. During me development of the color print cartridge it was 
important to maintain constant colors Variations in the color output over lime can 
be caused by many variables Measuring the color printed with a print cartridge 
gave some measure of process control Other methods of process control could 
have been implemented but colonmetnc data is particularly useful because the 
process limits are related to human perception 

Dye and Ink Selection. Both the dye and the ink vehicle affect the colors gener- 
ated with the color print cartridge Given target colors, inks can be formulated to 
match the targets Culorimetnc data is particularly useful here, especially CIE hue 
The hue values of the secondaries are very important and depend strongly on tfie 
dye selection For a given set of dyes, saturation was also found to be a useful 
metric 

Grey Balancing. Because the HP DeskJet 500C is a one-print cartridge color 
printer, black must be printed with a mixture of the CYfvl primaries. The best black 
was not obtained with a tertiary black but by using a different ratio of C:Y:M Color 
measurement was used to optimize the ratio of cyan, yellow, and magenta to print 
when black was requested This had two benefits The first is that the Quality of 
the black print is improved because the best possible black is printed The second 
is that less ink is needed to punt this black, so the bleed performance of the printer 
is improved 

WYSIWYG Printing, Jusi as measurement spaces can be mapped onto visually 
based spaces, process spaces can also be mapped onto any color space. This 
makes it possible to print device independent colors and duplicate the desired 
colors For example, CRT monitor colors are described by their own process space. 
RGB There is a one-to-one mapping, very dependent on the particular monitor, 
between RGB and CIELAB There is also a one-to-one mapping between CIELAB 
and the printer CYM space. By knowing both maps the colors shown on the moni- 
tor can be accurately duplicated on the printer, yielding WYSIWYG printing (What 
You See Is What You Get) The success of this process depends on the character- 
ization of the mapping from the printer CYM space to the CIELAB space This 
involves extensive color measurement and characterization of the printing system 
using a spectrophotometer Problems do occur with WYSIWYG printing when 
either the printer or the monitor cannot print or display the requested colors 
Sophisticated image manipulation has to occur so that a successful mapping can 
be accomplished 

John M. Skene 
Development Engineer 
Inkjet Components Division 




Fig. 2, Color difference metrics 
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To guarantee prim ran ridge performance, the condition of 
the ink over the life of the print cartridge was determined. 
Even in an enclosed system such as the HP DeskJet prim 
cartridge, there is vapor loss over time through the plastic 
walls. This vapor loss alters the ink vehicle composition and 
can ultimately affect the crusting performance. Once the 
rate of litis loss was determined, inks were made up having 
vehicle compositions that a print cartridge would have ai 
the end of its rated life. Dyes at various concentrations were 
placed in these vehicles and the crusting was tested The 
maximum dye concentration that could pass the crusting 
test was established as the temporary maximum for that 
particular dye. 

Next, a set of inks was made having a range of dye con- 
centrations, from low up to the previously established tem- 
porary maximum. Print samples were made with these inks 
and the L*a*b* coordinates of the samples w ere measured 
(see "Color Science in Three-Color Inkjet Print Cartridge 
Development." page 71). If a dye showed a rebound point 
(the dye concentration al which color performance decreases 
because of overloading), the rebound point was set as the 
new temporary maximum. These temporary maximum dye 
c oncentrations were then corrected for the effects of vapor 
loss so that the ink performance would be maintained 
throughout the projected print cartridge life. 

The next step in the process of color balancing was to deter- 
mine what colors were most important in the color DeskJol 
primers. The relative order of desirability of various reds, 
blues, and greens for secondary colors was established by 
market studies. The ratios of the different dyes needed to 
produce the optimum red. blue, and green secondaries were 
then determined by trial and error. 

For example, the concentration ratio of magenta to yellow 
Ihat produces an optimal red might be 1.8:0.7. The optimal 
ratios for the blue and green secondaries of the dye set 
would be different. In most cases the ratios didn't match 
exac tly. For instance, you could use the magenta-to-yellow 
ratio lo gel a good red. Then you c ould use that magenta 
concentration with the optimal ratio of magenta lo cyan to 
get a good blue. However, the resulting cyan and yellow con- 
centrations would then not be optimal lor green. Conse- 
quently, optimizing the overall color palette required that 
some tradeoffs be made. 

In making the Irade-offs and arriving al the final dye Con- 
centrations for use in the ink. an anchor dye and concentra- 
tion were c hosen. This anchor dye was chosen by evaluating 
the effect of dye concentration on chroma. The dye whose 
chroma level was most sensitive to dye concentration was 
chosen as the anchor dye. The concent rat ion of this dye was 
determined by using the temporary maximum ratios ob- 
tained earlier in conjunction with Ihe optimal ratios needed 
lo achieve good secondaries. 

For instance, say thai yellow is Chosen as the anchor color 
and that the maximum Concentrations for magenta, cyan, 
and yellow are r."i.. and I"", respectively. In addition, say 
that the optimum ratio of magenta lo yellow for achieving a 
gOOd red is 2.">:1 and the optimum ratio of yellow lo cyan for 
achieving a good green is 1.5:1. The new concentration for 



yellow is then calculated by dividing the maximum con- 
centrations of the added colors, cyan for instance, by the 
optimal mix ratios. In this example, this yields a yellow con- 
centration of 2.0% in the case of the red secondary and a 
concentration of 2.7% in the case of the green secondary 
The smallest of the two secondary concentrations is then 
chosen as the maximum concentration of the anchor color, 
w hich in this case is 2% yellow for the red secondary. 

Once optimal concentrations were found for ihe other two 
primaries in relation to the anchor dye concentration, prim 
samples were made using inks with these dye concentra- 
tions. Based upon a visual evaluation of the print samples, 
the concentrations of the two dependent primaries were 
varied to make various trade-offs in the secondary c olor 
space. 

For instance, it may be acceptable lo alter the hue of the red 
slightly to achieve a markedly better blue. This final stage of 
color balancing involves repeated trials and fine tuning of 
the two dependent primaries within the bounds established 
by the temporary maximum Concentrations. The process of 
printing samples, evaluating the color space, and then mak- 
ing concentration adjustments was repealed to arrive at the 
most acceptable color space- 
Matching the Ink and Printhead 

Thermal inkjet ink. printhead, and printer design are all ex- 
cellent examples of highly interactive technology develop- 
ment. Dev elopment progress cannot be made on an inkjet 
ink formulation without at the same time having an appro- 
priate printhead and printer in which lo test Ihe formulation. 
Likewise, printhead architecture and printer development 
efforts are very closely coupled to the performance charac- 
teristics of the ink formulation Progress cannot be made in 
any one of these three arenas without the aid and support of 
the other two. This very interactive development environ- 
ment presents a variety of technological and organizational 
challenges. In this environment, a carefully constructed ink 
screening Strategy was required for timely printer system 

development, 

The ink screening strategy that was used for the color HP 
DeskJet 5()()( '/Desk Writer C print system development was 
based on rapid iterations of ink formulation, print cartridge 
architecture, and printer test beds. F.mpirieal performance 
testing was conducted on do/ens of ink formulations over a 
period of a few short months to determine the final ink com- 
position A progressively wider Variety of tests were run on 
each ink formulation and printhead architecture combina- 
tion. Once a given combination had passed quick, easy-to- 
complete initial testing, ii was progressively subjected to 
tests requiring greater time and resource investments. The 
intended outcome of this effort w as a printing system that 
provided outstanding plain paper color printing performance 
over a wide range of media and printing environments. 

For a given primer resolution. Ihe interac tion of ink. print 
modes, and print media determines Ihe final print quality. As 
discussed earlier, this interaction is largely controlled by Ihe 
nature of Ihe ink vehicle. For a given droplet volume, differ- 
ent ink vehicles will produce differing dot sizes on ihe vari- 
ous types of media. Ideally, one would like absolute control 
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of dot size and placement Given an attractive ink formula- 
tion, variations in the printhead architecture are used to 
control dot size and placement (see "Thermal Inkjet 
Review....'* page 67), 

To produce optimal prim quality, the ink and the delivered 
dr6p volume need to be carefully matc hed. If the drop vol- 
ume is too low, paper coverage will be incomplete. This re- 
sults in desaturation of colors and irregular area fill. When 
the drop volume is too high, resolution and edge acuity suf- 
fer, and adjacent colors have a greater tendency to bleed 
into one another. Local printhead architecture largely deter- 
mines the delivered drop volume. The surface area of the 
thin-film drive resistor and the exit diameter of the nozzle 
bore are the most significant factors in determining ejected 
drop volume. Ink viscosity, surface tension, and the size and 
shape of the photosensitive barrier film that surrounds the 
thin-film resistor largely determine the frequency response 
of the thermal inkjet device. 2 These are the main variables 
used by the printhead designer to optimize print quality 
and speed. These variables were tailored for each of the ink 
formulations tested. 

Printhead Performance Tests 

Candidate ink formulations were subjected to a wide range 
of performance tests. This regime consisted of both printing 
and print cartridge reliability tests. The central theme behind 
all of I his effort was to test the print systems ability to deliver 
outstanding plain paper color print quality consistently over 
the life of the printhead. 

Thfi printing tests were conducted over a large sample of 
different media types. An international selection of "plain" 
papers was included in the sampling. In addition, printing 
performance was measured on HP IJC JetSeries transparency 
film and IIP CX JetSeries special paper. These special media 
products have been developed by HP to provide maximum 
thermal inkjet print quality. In addition to printing on a 
large number of different media, testing was conducted us- 
ing different settings of the printer driver intensity control 
(see article, page 93). Environmental conditions for printing 
were systematically varied to cover the printer operating 
environment (5°C to 40°C, 10% to 80% relative humidity). 

Several specific print quality attributes were examined dur- 
ing the course of the performance tests. These attributes 
include color bleed, color saturation, composite black text 
quality, and "wait banding" performance (defined later). 

As discussed earlier, color bleed is defined as the undesu- 
able intermixing of two different colors when printed im- 
mediately adjacent to each another (Fig. 1). This intermixing 
results in an irregular deviation from the intended color in- 
terface profile. It is most visible when the t wo colors are of 
high relative contrast. Yellow and red or yellow and black 
are the color pairs most often tested for color bleed. Color 
bleed was quantified by visual comparisons against a set of 
standards. These standards were prepared such that varying 
degrees of bleed were produced. A bleed index number was 
(hen assigned to each of the standards. In addition to being 
a strong function of ink formulation and delivered drop vol- 
ume, color bleed is also closely coupled to the dot density, 
media, and prim ing environment. It is aggravated by high 
temperature and high humidity. A significant portion of I he 



color bleed testing was completed at temperatures of 35 = C 
to 40 6 C at 80% relative humidity. 

Color saturation was quantified using either spectropholom- 
eters or colorimeters (see "Color Science in Three-Color Ink- 
jet Print Cartridge Development." page 71 ). It is defined as: 

S*= C*/L*= v a*- + b* 2 /L* 

where: C* = measured color chroma 

a* = red-green color coordinate 
b* = yellow-blue color coordinate 
L* = lightness color coordinate. 

The major factors affecting color saturalion include colorant 
concentration, vehicle formulation, drop volume, print mode, 
media, and printing env ironment. A cold, dry printing envi- 
ronment was found to reduce color saturation. Testing for 
color saturation was completed at 8*€ to 10°C and 10% to 
15% relative humidity. 

The creation of high-quality black text characters and area 
fills using composite black is a particularly demanding task. 
As was the case for color bleed, internally generated com- 
parative standards were used to quantify composite black 
performance. Ink vehicle formulation, color balance, and 
printing media arc the factors most strongly influencing 
composite black text and area fill quality. 

Waitbandiiifi is a rather interesting printing artifact associ- 
ated with the nonsimultaneous raster printing of colors, The 
nozzle arrangement in the color printhead is such that the 
different colors are printed serially onto the print media 
(Fig. 2). Cyan is the firsl ink to be delivered to the page, fol- 
lowed by magenta, then yellow. Nozzles for the three colors 
are spatially separated on the printhead by four dot rows 
(4/300 inch) in the paper advance direction. This results in a 
l ime separation of successive color ink delivery on the order 



Cyan Nozzles 




Fig. 2. The nuzzle arrangement in the color printhead is designed 
to deliver the Iliree colors serially In the media, cyan oeing the 
first. Sixteen nozzles are provided for eacli of the three colors. 
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of 1 to 2 seconds, depending on print mode and scan width. 
During the time interval between delivery' of one color and 
delivery of the next there is a significant amount of media 
penetration and evaporation of the ink vehicle. When mixed, 
or secondary colors ( red, green, blue, and composite black) 
are formed, two or more of the primary colors are delivered 
to a given local area of the print media 

Waitbanding can occur if the time interval between succes- 
sive raster scans during printing is significantly varied when 
creating secondary colors. When this time interval is varied, 
differing amounts of ink penetration and evaporation take 
place before the next primary color is printed onto the first 
primary color. The result can l>e the creation of a horizontal 
band of varied lightness and chroma. This variation in 
color appearance was quantified using the following color 
difference relationship: 

AE = v AL*" + Aa*- + Ab* 2 . 

where AL*. Aa*, and Ab* are the differences in the color 
coordinates in the region of the waitband relative to an 
adjacent region outside the waitband. 

The control of the time interval between scans involves a 
rather complex interplay between the user's computer hard- 
ware, the application software, the operating system, the 
communication network to the printer (if used ), and the 
printer processor and firmware. Several steps have been 
taken in the development of the color HP DeskJet 500C/ 
Desk Writer C system to minimize the variability in the time 
interval between successive scans and its impact on wait- 
banding. Ink formulation also can play a major role in mini- 
mizing the magnitude of the waitband color difference. It is 
for this reason that waitbanding testing was included as one 
of the screening parameters in the ink formulation selection 
regime. 

Several print cartridge performance attributes not directly 
associated with a print sample were measured as well. In 
addition to safely and regulatory considerations (see page 
76), the selection of ink components is also often limited by 
compatibility with lite oilier materials in the prinihead and 
printer system. These system-level limitations are typically 
determined using elevated temperature materials compati- 
bilily and prinihead lesls. In I heir simplest form, these tests 
consist of immersing a given prinihead or printer compo- 
nent in the ink or ink constituent of interest. The immersed 
component is then subjected to elevated temperatures for a 
period of several weeks. Degradation of I he component and 
Changes in the ink or ink constituent are then measured at 
the end of this period. Arrhenius modeling 3 of degradation 
reaction rates is often used to assess the impact on print 
system longevity and reliability. 

Reducing Ihe tendency for the very small prinihead nozzles 
to clog or crust over presenls an ongoing challenge for the 
ink chemist and product designers. A Ink formulation testing 
for Ihe time to the first misdirected droplet was conducted 
throughout the ink development effort. The major prinihead 
design factors affecting crusting behavior include colorant 
species and concentration, vehicle solvents, and the envi- 
ronment in which the prinihead is being operated. The print - 
it designers can contribute to the prevention of clogging by 



providing special print cartridge servicing functions. These 
functions include tight capping of the print cartridge s ori- 
fice plate when not in use. plus periodic wiping and firing of 
the nozzles before and during printing. 

An additional area of ink formulation screening is kogation 
testing. As discussed earlier, kogation is defined as the 
plaque-like buildup found on the thin-film resistor surfaces 
of thermal inkjet devices after many firings of the print- 
head. 4 ^ Kogation testing consists of monitoring the drop 
volume or color saturation a given prinihead produces over 
the full lifetime of the head. Careful selection of ink constit- 
uents and control over the thin-film resistor drive energy are 
needed to reduce or eliminate kogation problems. 

Ink Quality Control 

The inks for the IIP DeskJet 500C and DeskWriter C printers 
were carefully formulated to provide outstanding perfor- 
mance on a wide variety of plain papers. Careful control 
of ink component concentrations and purities during ink 
production is critical to ensure that customers receive 
consistently good performance from their print cartridges. 

The consequences of ink impiuities or incorrect ink compo- 
nent concentrations can be quite severe. For instance, incor- 
rect dye loads and impurities in the dyes can cause signifi- 
cant shifts in Ihe hues thai are produced by Ihe printer. 
Incorrect pH values can cause chemical instabilities in the 
ink as well as undesirable interactions with some of the 
prim cartridge materials. Incorrect ink viscosity can drasti- 
cally alter the firing characteristics of the print cartridge, 
resulting in poorly formed droplets and poor print quality. 
Improper concentrations of certain vehicle components can 
lead to a loss of bleed control between different colors. Im- 
purities in the dyes or other vehicle components can cause 
nozzle plugging and reduce bleed control and print quality. 

The physical ink parameters that are monitored include: 
static SUr&Ce tension, viscosity, pll. solution absorbance, 
and conductivity. A Hewlett-Packard 581KJA gas chromato- 
graph ( GC ) wit h an IIP 9000 Series 300 worksiat ion is used 
to determine the concentrations of sonic of Ihe components 
in Ihe ink. In addition, Ihe GC is used to verify Ihe purity of 
one of Ihe ink constituents before ii is added to the ink. 
Dyes are purified using a variety of procedures including 
reverse osmosis. Selected ion levels are monitored using 
atolnic absorption spectrometry. 

Not all of the components in the ink are determined by a 
specific analytical method. Certain practical compromises 
were necessary. The concentration of some components is 
very difficult lo determine in the ink matrix or requires very 
specialized and expensive analytical equipment thai is not 
COSt-effective in a production environment For these com- 
ponents, indited methods are used to verify proper concen- 
trations, along With the very careful use of process control 
procedures such as weight logs. 

Delivery to the customer of consistent, high-quality output 
relies on many factors, one of which is a consistent ink com- 
position. By carefully controlling ink component concentra- 
tions and purities, delivery of this consistent performance to 
ihe customer is ensured. 
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Making HP Print Cartridges Safe for Consumers Around the World 



The ability to provide assurance that our products do not cause harm to consumers 
and the environment is central to Hewlett-Packard's thermal Inkjet product stew- 
ardship Ink safety evaluation is the most important part of the overall safety and 
regulatory process. 

Thermal inkjet print cartridges contain liquid inks designed to deliver laser-pnnter- 
like print quality These liquid inks contain dyes, solvents, and special additives, 
which are essential to the operation of the print cartridges. Chemists spend consiri 
erable time developing these inks and often synthesize new chemicals to inwove 
print quality 

To ensure the safety and marketability of our print cartridges worldwide, all new 
ink and print cartridge ingredients are screened far potential safety Itoxicologyl 
and regulatory concerns during the research and development phase. 

To begin with, all ingredients are screened to eliminate any ingredients that may 
elicit harmful toxic responses in humans and the environment The scientific litera- 
ture is searched to determine what is known about the ingredients Structural 
activity relationships are used to estimate the toxicity of newly synthesized chemi- 
cals In addition, a tier approach is used to determine the scientific tests to be 
conducted. The tier system consists of both Hewlett-Packard's safety standards 
and government regulatory requirements. 

Regulatory concerns for newly synthesized chemicals used in ink and print car- 
tridge development are more complex Since we sell our print cartridges world 
wide, we must obey all laws governing chemicals contained within an article — in 
this case the print cartridge These laws differ by country and we must understand 
and develop inks accordingly For example, in the United States our print cartridges 
are regulated by the Environmental Protection Agency (EPA) and the Consumer 
Product Safety Commission ICPSC) 

To sell punt cartridges worldwide, we must obey many rules and regulations For 
example, in the U S A , we must file a Premanufacture Notification (PMN) with the 



EPA before we sell a print cartridge containing a new chemical not listed on the 
United States master inventory The cost for filing a PMN is about $2,500 per 
chemical and there is a 90-to-180-day waning period lor approval There are 
substantial penalties for noncompliance. 

In Europe our print cartridge inks are regulated by the European Economic Com- 
munity. The EEC requires a series of scientific tests before we can market a print 
cartridge cnntainmg a new ink. These tests can cost from S50.000 to $1 50,000 per 
compound and take up to one year to conduct In addition, the Europeans have a 
45-day waiting period before allowing the sale of any new ingredients. 

Japan's laws (regulated by the Japanese Ministry] are slightly more favorable foi 
our print cartridges because they only require the registration of certain chemi- 
cals Since none of our inkjet chemicals are listed on Japan's list of "hazardous 
chemicals" and our print cartridges fall under an article exemption law. we do not 
have to register out newly synthesized ingredients. However, the ingredients must 
not impose any health risk to humans and the environment. 

Obviously, there are many more countries, laws, and regulations we must obey to 
sell our print cartridges worldwide Our print cartridge and ink ingredients must be 
safe and free of any legal restraints. By addressing safety and regulatory issues 
during the research and development stage, Hewlett-Packard intends to continue 
to meet the needs of its many markets worldwide. 
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Automated Assembly of the HP 
DeskJet 500C/DeskWriter C Color 
Print Cartridge 

Roughly 60% of the assembly technology had to be developed especially 
for the color print cartridge. Plastic welding, adhesive dispensing, TAB 
circuit staking, and ink fill were among the challenges. 

by Lee S. Mason and Mark C. Huth 



One of the key considerations in the development of the 
rolor print cartridge for the IIP DeskJet 500C anil Desk- 
Writer (" printers was time to market. Because of this, every 
effort was made to leverage the design and assembly 
technology of the existing black print cartridge where pos- 
sible, and to improve on the design where the opportunity 
presented itself. 

Fig. 1 shows an exploded view of the color print cartridge 
Fig. 2 shows the assembly line for this cartridge. Many of the 
stations were leveraged to some degree from the black print 
cartridge assembly line. The chart below lists the stations 
that were heavily leveraged (less than 20"n of I he station 

redesigned i. i lerately leveraged t2n to 70* of the Station 

redesigned), and not leveraged (essentially a new design). 



Heavily 


Moderately 


New Design 


Leveraged 


Leveraged 




liai < ode Libel 


0-1 Transfer 


Nose Load 


Head Attach 


Dale Code 


Body Load 


Head ( 'lire 


Print Test 


Nose Weld 


TAB Prep 


Offload and Tape 


Filter Stake 


TAB Attach 




Filler Stake Inspeci 


TAB Bond 




Pail Offload 


TAB Wrap 




Adhesive Dispense 


FY Cure 




Adhesive Inspect 


ErTesl 




Cheek Stake 


1-2 Transfer 




Side lleatstake 


P«l Seal 




Encapstilant 


( lean and Dry 




1 lispense 


Print Test 




Foam Feed 


Transfer 




Foam Stuff 



Lid Load 
Lid Weld 
Ink Fill 
( lap Load 

i ap Weld 



Areas of Improvement 

The manufacturing development team Identified six areas 
where there was significant opportunity lo improve a pro- 
cess from Ihe way il was performed for the black prim 
carl ridge assembly. These areas are: 



• Cartridge Body Preassernbly. A new subsection was 
added lo the assembly line for preassemhly of the two-part 
cartridge body and the Tillers. 

• Bulk Pan Feeding. Plastic parts, filters, and loam blocks are 
automatically fed into the assembly tooling from bulk pan 
hoppetS or bowls. These pans are delivered to the produc- 
tion line in bags and loaded into the hoppers as needed. 

• Adhesive Dispense. Positioning and bead width conlrol 
required substantial improvement for the color print 

i an ridge. 

• TAB Circuit Staking. To simplify the process for attaching 
the flexible TAB (lape automated bonding) circuit to the 
cartridge body, a direct staking process is used, eliminating 
the need for adhesive belvveen the Ivvo parts. 

• Ink Fill. The ink fill process was redesigned to allow more 
conlrol of Ihe process parameters. 

• Throughput and I tili/.alion To make Ihe assembly line 
more efficient, the fastesl conveyor speed available is used, 
and Ihe conlrol of pallets through each assembly tool is 
handled by Ihe process controller resident in the tool itself 

rather than a central One controller. To improve line utiliza- 
tion space for pan buffering belvveen assembly processes 
was designed in, allowing pails lo accumulale during short 
periods Of tool downtime. 

Production Line Layout 

The basis for the color prinl cartridge production line is a 
"power and free" Style conveyor system. In litis system, Ivvo 
narrow, parallel conveyor bells run continuously. Square 
pallets rest freely on the belts, which move the pallets be 
Iween assembly tools. These pallets contain lixturing lhal 
holds the print cartridge parts during assembly. All assembly 
and inspection operations are completely automated W illi 
the exception of prebond inspection. Operator responsibili 
ties include material loading, finished product unloading, 
and tool and process monitoring. 

The color prinl cartridge production line has I hi ee subsec- 
tions referred to as Assembly 0, Assembly I. and Assembly 2 
(see Fig. 2). Each section uses its own unique pallet to fix- 
lure the print cartridge parts slightly differently, as required 
for thai sections sequence of assembly. 
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Assembly 0 

This section of the line has the tools necessary for print car- 
tridge body preassembly. It consists of two parallel and adja- 
cent Conveyor sections approximately :J5 feel long. One con- 
veyor moves Assembly 0 pallets between tools and the 
adjacent conveyor carries pallets with completed plastic 
body preassemblies back to the start of the next subsect ion 
of the assembly line. 

The processes that occur during plastic body preassembly 
are nose load, body load, nose weld, filter slake, filter stake 
inspect, and fail offload. 

At the nose load station, a plastic nosepiece is loaded into 
an empty Assembly (I pallet. Vibratory bowl and inline feed- 
ers move the parts to a pick position, where a cam-actuated 
pick-and-place device grasps a nosepiece and places it into 
the pallet. A bulk hopper and elevator system is used to 
regulate the part level in the vibratory bowl. 

The body load station loads the plastic body pail into the 
Assembly 0 pallet on lop of the nosepiece. It is virtually 
identical to the nosepiece load station except for the size 
and shape of the pari being fed. 

The nose weld station ultrasonically joins the plastic body 
and nosepiece. 

At the filter slake station, a stainless sleel mesh filter is 
heatstaked to the plastic body at the inlet to each of the 
three ink channels. Three parallel filter feeding and slaking 
processes are used to accomplish this. In each process, the 
filter is first fed using a vibratory bowl to a pick position 
where it is picked using vacuum and presented to a heat- 
staking head by a pneumatically actuated pick-and-place 
device. The heatstake head then moves down Into the plas- 
tic body and thermally joins the filter to the inlet of the ink 
channel. 

After the ihree filters are staked into the plastic body, ma- 
chine vision is used to inspect the quality of the process 
automatically. The filters are inspected for presence and 
location, and the completeness of the weld is determined. 
For a more detailed description see the ailicle "Machine 
Vision in Color Print Cartridge Production," page 87. 

The last process in the Assembly 0 subsection is the fail off- 
load station. Here any plastic body preassemblies that have 
failed filter Inspection are ejected from the Assembly 0 pallet 
into a scrap container. 

Assembly 1 

Assembly 1 consists of the operations that are performed 
with the nozzle (or prinlhead) side of the pen facing up. This 
subsect ion is configured in a large rectangle about 50 feet 
long and about 10 feet wide. 

The stations on the .Assembly 1 portion of the line are 
0-1 transfer, bar-code label, adhesive dispense, adhesive 
inspect, head attach, head cure, date code, TAB prep, TAB 
attach, cheek stake, TAB bond, TAB wrap, side heatstake, 
encapsulant dispense, (JV cure, and E-test. 



Fig. 1. Exploded view of the color print cartridge. 
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The 0-1 transfer station uses a pneumatically actuated robot 
to remove the pen bodies from the Assembly 0 pallet, rotate 
them 180 degrees and place them onto the Assembly 1 pal- 
let. The Assembly 1 pallet uses a set of two metal locating 
mandrels which fit inside the cavity of the pen body assem- 
bly to locate two pen bodies precisely on each pallet. As- 
sembly 1 is the only subsection of Ihe line that has two pens 
on a single pallet. This lowers the cycle time. The cartidge 
orientation necessary for the Assembly 0 and Assembly 2 
operations prevented having more than one cartridge on a 
pallet. 

The labeling machine applies a bar-code label to each pen 
body assembly. The bar-code label is used in downstream 
processes. 

The adhesive dispense station uses x, y, and X servo-driven 
positioning tables and a metering pump to dispense the 
adhesive thai secures the printhead to the plastic pen body 
assembly. Adhesive location and thickness are critical to 
proper adhesion of the printhead with no leaks hetween 
chambers. ( Interchamber leaks cause contamination of the 



ink supply and will give unexpected colors. ) For a more de- 
tailed description of the adhesive dispense operation, see 
the article on page 84. 

The adhesive inspect station uses a machine \ision system 
to verify that the adhesive pattern is complete and free of 
voids. For a more detailed description of this operation, see 
the article entitled "Machine Vision in Color Print Cartridge 
Production." page 87. 

The head attach machine consists of several high-precision 
servo-driven positioning tables, a machine vision system, 
and a system controller. The machine remov es the printhead 
die from a film frame, and using machine vision, aligns the 
pen body assembly with the die. The die is then placed into 
the adhesive previously dispensed by the adhesive dispense 
station. This machine must locate the printhead with respect 
to the pen body assembly within extremely tight tolerances 
to ensure proper operation of the print cartridge. For a 
more detailed explanation of this piece of equipment, see 
reference 1. 




Ancusi 1002 Hewlett-Packard Journal 79 



© Copr. 1949-1998 Hewlett-Packard Co. 



The head c ure station thermally cures the* ailliesive that at- 
taches the printhead to the plastic pen hody assembly. The 
head cure station consists of several identical stations in 
series, and the adhesive is partially cured at each station. 
Acceleration and deceleration of the pallets must be carefully 
controlled in the head cure stations to prevent printhead 
shift while the adhesive is in its uncured stale. 

The date code station uses an inkjei printer to imprint a dale 
code onto the plastic pen hody assembly. The date code is 
used within the Inkjet Components Division to track date of 
manufacture, thin-film printhead lots, yields, and other 

information. 

The TAB prep machine unrolls the raw TAB stock front a 
reel, uses machine vision to inspect the part, and punches 
and presents good pails to the TAB alignment machine. 

The TAB attach machine uses machine vision and servo- 
driven positioning tables to align the pen body assembly 
precisely with the TAB circuit. The TAB circuit provides the 
electrical interconnect between the pen's printhead and the 
printer. Once the alignment has been performed, the TAB 
circuit is lacked into place using a healed probe. 

The cheek stake tool uses a heated platen to melt the plastic 
nose beneath the TAB circuit to attach the cheeks of the 
TAB circuit permanently to I he pen body assembly. The plat- 
en is compliant, which is necessary to ensure that the entire 
surface of the TAB circuit adheres to the pen body assembly. 

The TAB bond machines ultrasonically weld the "beams" of 
the TAB circuit to the interconnect. The station uses machine 
vision to align the ultrasonic probe with the TAB circuit. 

The TAB wrap tool engages the tooling holes in the TAB 
circuit, folds the TAB circ uit over 90 degrees, and holds it in 
position while- it is tacked into place using a healed probe. 

The side healslake tool is similar to the cheek slake tool 
described previously. It uses a heated platen to melt die 
plastic pen body beneath the TAB circuit to attach the side 
of the TAB circuit permanently to the pen body assembly. 
The platell is compliant, w hich is necessary to ensure I hat 
the entire surface of the TAB circuit adheres lo the pen body 
assembly. 

The encapsulanl dispense station is very similar lo the adhe- 
sive dispense station. Il uses x. y, and z seivo-diiven posi- 
tioning tables and a metering pump lo dispense the adhesive 
that protects the exposed beams of the TAB circuit. Reflec- 
tive laser sensors are used to locate each pen hody assembly 
precisely before dispensing the encapsulanl. 

The UV cure station cures the encapsulanl adhesive with 
Ultraviolet light. After cure, the bonds are stressed lo break 
any TAB bonds that might otherw ise fail in the field. 

The E-test is Ihe final station in the Assembly 1 subsection. 
I "sing a spring probe block, il electrically verifies all of the 
interc onnections between the printhead and the TAB circuit. 

Assembly 2 

The last subsection of ihe assembly line consists of assem- 
bly and test operations performed with the printhead facing 



down and Ihe ink containment c avities facing up. Assembly 
2 consists of a conveyor loop approximately 15 feet long and 
10 feet wide. One cartridge is held on each pallet. The opera- 
tions I hat occur in Assembly 2 are 1-2 transfer, pen seal, 
foam feed, foam stuff, lid load, lid weld, ink fill, cap load, 
cap weld, print lest transfer, print test, clean and dry, and 
offload and tape. 

The 1-2 transfer station uses a pneumatically actuated robot 
to remove print cartridge assemblies from the Assembly 1 
pallet, rotate them ISO degrees and place them into the 
Assembly 2 pallet. 

The pen seal station uses a pneumatic cylinder lo ensure 
thai the print cartridges are completely sealed within the 
pallet. This prevents damage to the downstream tools. 

The foam feed system consists of a bulk hopper, a eiealed 
bell elevator, and vibratory feeders, which deliver foam 
blocks Icj ihe three inlet positions of the foam Stuff machine. 
The foam blocks are dropped into the compressor section of 
the tool, which compresses the foam in two axes. A ram 
then pushes the foam blocks up into three sheet-metal tubes 
with rectangular cross sections. The lube set is then moved 
down into the pen body and a ram drives Ihe foam out of Ihe 
lubes and into the pen body. The motions of Ihe tubes and 
Ihe nun during the stuffing sequence are precisely controlled 
by motor-driven cams. 

At the lid load station, the plastic lid is loaded onto the top 
of the cartridge lo cover the foam blocks. Vibratory bowl 
and inline feeders move the parts lo a pick position when- a 
cam-actuated pick-and-place device grasps the lid and places 
it onto the cartridge- A bulk hopper and elevator system is 
used to regulate the pan level in Ihe vibratory bowl. 

The lid weld station ultrasonically joins the plastic lid and 
body. 

The ink fill process is Initiated by forming a chamber around 
the print can ridge and evacuating Ihe chamber to a high 
vacuum. Ink is pumped into Ihe cartridge using hollow 
needles, which pass through the- ihree holes in the plastic- 
lid. 

The cap load suit ion places the plastic cap onto the top of 
the print c an ridge assembly. The purpose of ihe cap is to 
provide a handling surface for Ihe user and to minimize 
water vapor loss from the cartridge over its life. This station 
is similar to Ihe lid load station and the other plastic part 
loading stations. 

The cap weld station ultrasonically joins Ihe plastic cap and 
body. II is the last assembly process on the line. 

The print lest system is used to evaluate Ihe print quality of 
the cartridge. Each cartridge is transferred to a four-position 
rotary table to allow multiple operations to occur in parallel. 
At one position on the rotary table, electrical contact is 
made with the print Cartridge and a test pattern is fired 
while a si rip of paper is passed beneath the cartridge. The 
lest patten) is then moved beneath cameras and a vision 
system is used to determine if all nozzles are firing correctly. 
For a more detailed description of print quality testing see 
the article on page 87. 
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The cartridge clean and dry station uses a water loop, which 
contacts the printhead surface and cleans off the ink residue 
left behind during print quality testing. After cleaning with 
water, the printhead is dried with heated air. 

The last operations on the production line occur at the off- 
load and tape tool. Here the cartridges are first sorted based 
on the results of the print quality testing. Cartridges that 
have failed the print test are offloaded from the pallet onto a 
conveyor for removal by an operator. Cartridges that have 
passed print testing are offloaded from the pallet into a tool 
that tapes the printhead area to contain the ink during ship- 
ping and storage. These cartridges are then offloaded onto a 
separate output conveyor for removal by an operator. 

Continuous Development 

Many challenges were encountered during the Startup phase 
of the high-volume color print cartridge production line. 
Initial yields were very low on the high-volume line, roughly 
50% of what was seen on the prototype line. There were two 
main reasons for this discrepancy in yields. First, prototype 
yields were misleading because relatively few pens were 
built on the prototype line. Second, in an effort to get as 
many good lest pens as possible, pens were "nursed" through 
the prototype manufacturing process. Unfortunately, most 
of these special steps were not readily transferable to the 
high-volume production line. 

The low yields at initial line startup triggered an intensive 
series of experiments to determine how yield could be im- 
proved. Based on the results of these experiments, several 
new processes were developed and new stations were 
designed and implemented on the assembly line. 

Many of these new processes are proprietary. < )ne that is 
not is the addition of pallet decelerators in numerous places 
around the line. The production line uses very small, light- 
weight pallets, and to keep cycle times to a minimum, the 
line uses the highest -speed Bosch conveyor. Light pallets 
and fast belts combine to transfer large shock loads to the 
pans and may shift the positions of any loose parts. The 
pallet decelerators reduce these shocks and allow us to 
benefit from lighter pallets and faster bells without 
compromising pen quality. 

Impact of Leverage 

As mentioned earlier, some portions of the color print car- 
tridge design were heavily leveraged frOm the previously 
introduced black print can ridge. Our team learned that 
when it comes to leverage, more is better. The portions of 
the assembly technology that were not leveraged from the 
black cartridge (roughly f>0% were not ) provided the most 
challenges and headaches. 

For example, the development of a two-piece nose and body 
that can be ultrasonic-ally welded together to form the three 
separate ink channels of the printhead ink path was ex- 
tremely challenging (see "Color Inkjet I'rint Cartridge Ink 
Manifold Design." page S2). The plastic material was diffi- 
cult to weld. So much energy was required to produce the 



weld that the weld horns would stress crack after a few 
hundred parts. Fabrication tolerances of the two molded 
plastic parts were very tight and critical to achieving 
leak-free welds. 

The process tolerances for this product required a total re- 
design of the adhesive dispense equipment. The equipment 
was custom-designed in-house (as was most of the color 
print cart ridge assembly equipment i because no commercial 
dispense systems could meet die critical locational toler- 
ances. The color print cartridge also requires different types 
of adhesives because die color inks are not compatible with 
the adhesives used on the black print c artridge. 

This print cartridge was the first to use heatstaking to ad- 
here the TAB circuit to the plastic body (The black print 
cartridge uses patches of die-cut hot melt adhesiv e to bond 
the TAB circuit to the body. ) The compliant design of the 
cheek and side heatstakers allows the entire surface area 
of the TAB circuit to adhere to the body, even without the 
compliance provided by the soft hot melt patches. 

Because of the three different-colored inks and three foam 
blocks, a special approach was necessary in the foam stuff 
process. The foam stuff machine was totally redesigned lo 
accomplish the compression and simultaneous insertion of 
the three small foams. 

The formulation of the colored inks presented numerous 
challenges to the ink fill process. During ink fill, the print 
can ridge is filled with ink and the ink front moves through 
the ink channels to the printhead. The print Cartridge is first 
placed in a chamber and the air is evacuated. The vacuum 
level must be precisely controlled during the entire opera- 
lion or one of several failure modes may be induced. The ink 
may froth as it is pulled past the print cartridge's fillers. This 
produces a print cartridge that is prone to "drooling." 
( I trooling is ink dripping out of the printhead without the 
print cartridge's being fired.) The print cartridge may con- 
lain air bubbles and fail the print quality tesl. The ink must 
be removed from the Surface of the printhead quickly as the 
ink front moves past the nozzles. If it is not. the inks will 
mix on the surface Of the printhead and contaminate the ink 
supply. Dozens of enhancements were made alter the ink fill 
station was online as we gained a heller understanding of 
the process. 

The last major deviation from the process for the black print 
cartridge involves the nozzle taping operation. The tape re- 
quirements for the color print cartridge are much more re- 
strictive than for the black print cartridge. First, a tape leak 
in the color print cartridge will result in ink mixing, which is 
a functional failure. In the black print cartridge a tape leak is 
only a cosmetic concern. Second, the primitives (groups of 
ink nozzles) in the color print cartridge are very close to- 
gether and close to the edge of the printhead. These factors 
make tape leaks much more probable with the color print 
cartridge. Proprietary process changes in I he manufacture 
and application of the tape allowed us to solve these taping 
problems. 
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Color Inkjet Print Cartridge Ink Manifold Design 



In a disposable printfiead, the ink reservoir, the ink plumbing, and the ink firing 
device must reside within the print cartridge. One of the challenging aspects of 
the print cartridge design for the HP DeskJet 5Q0C and DeskWnter C printers was 
the creation of the manifold required to deliver ink from the foam ink reservoir to 
the prmthead This article deals with the pans and processes used to create this 
manifold. 

Ultrasonic welding is used to join the two parts that form the manifold This pro- 
cess had been used on the previously released black mk|et pen to attach the 
plastic cap to the plastic body. Building on some of the prior welding experience, 
a joint was designed. 

Joint Design 

The joint is a double shear joint, mainly because the literature strongly suggested 
that shear joints are the best joint design for obtaining a hermetic seal In designing 
an ultrasonic weld joint, three basic requirements must be met 

• Small initial contact area between the two parts. This concentrates the energy to 
allow the material to melt faster, reducing weld time 

• Uniform and intimate contact. The entire weld pint should be in uniform contact 
with the mating pan This allows uniform energy distribution to the entire weld 
loint. 

• A means of alignment If the joint does not provide a means for aligning the parts, 
some other means (e.g., pin and socketsl must be used 

Fig. 1 shows the weld joint cross section geometry. The small initial contact area 
is obtained by having the sharp corners of the male piece (nosepiecel interfere 
with the chamfer in the smaller opening of the female piece (print cartridge body! 

Tight molding tolerances are required on both parts to maintain uniform and inti- 
mate contact Fig. 1 also shows how the manifolds are formed by the weld joint. 
This weld line is not a simple straight line but a rather complicated irregular 
shape. Obtaining the tight design tolerances challenged the technology of multiple- 
cavity mold generation and plastic injection molding All plastic parts were mod- 
eled as three-dimensional solids using the Hewlett-Packard ME 30 system. The 3D 
bodies were used to generate the tool paths for CNC-machining the electrodes 
used to make the molds. 

Part alignment is obtained by the fit of the male piece into the chamfer on the 
female portion, which accurately locates the two parts with respect to each other 

Equipment 

The welder consists of a oower supply and an actuator The power supply converts 
line voltage. 220V at 50 Hz, to 1 ,000V at 20 kHz. and also contains the controls for 
setting weld time or weld energy and trigger force. The 20-kHz electrical signal is 



converted to mechanical vibration by a piezoelectric transducer called the convert- 
er The actuator holds the convener, booster, and horn. It also provides the con- 
trols for down speed and ram air pressure, and houses the encoder for determin- 
ing horn location The actuator is mounted on a rigid stand While pans are being 
welded, they are held by a nest attached to the production line 

Weld Theory 

Ultrasonic welding works by causing friction between two interfering plastic parts 
This friction is present on Doth the microscopic and macroscopic levels. The heat 
generated by the friction melts the plastic at the weld joint While the plastic is 
molten, further vibration causes polymer chains of the mating parts to intermingle. 
As the plastic cdoIs, the two pans became one When an ultrasonic weld is done 
correctly, the physical strength of the joint approaches that of the parent material. 
There are many benefits to using this method for joining plastics, not limited to 
the following list 

• It does not require the use of solvents or adhesives (no need for added 
ventilation). 

• Part count is reduced (no need for gaskets, mechanical fasteners, etc ! 

• It is easy to integrate into an automated assembly process (microprocessor 
control, fast cycle time). 

• The highly localized heating results in little or no part deformation 

Process Variables 

The mechanical energy delivered to the parts can best be described as a function 
of force, amplitude, frequency, and weld time. Force is dependent upon air pres- 
sure, ram area, down speed, and trigger force Air pressure can be adjusted via a 
regulator flam area is a constant Down speed is adjusted by regulating the flow 
on the backside of the ram Trigger force is read from a transducer, and the trigger- 
ing level can be varied and controlled by the power supply The amplitude present 
at the horn/part interface is defined by the equation 

Amplitude ■ (Converter Output) x (Booster Gain) x (Horn Gain) 

Convener output is constant at 0 0008 inch Booster gain can be varied m discrete 
amounts Horn gam is constant and depends jpon the horn design Frequency is a 
constant, in tins case 20 kHz, although welders can be purchased with different 
frequencies. Weld time is a variable that can be adjusted and controlled by the 
power supply Therefore, the way in which energy is delivered to the plastic parts 
can be varied by as many as five independent variables. 

One of the more important parameters affecting weld quality is the amplitude of 
the mechanical energy The "stack" used to deliver the weld energy to the plastic 
parts consists of the convener, the booster, and the horn The horn is an acoustically 
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tuned pan that transfers the vibration into the plastic pans The booster >s an 
acoustically tuned coupler used to connect the hom to the converter The size and 
shape pi a Booster affect its gam Standard Booster gams range from 0 5 to 2 5 
Tne amplitude can oe varied 0-/ interchanging boosters with differing gams. As 
mentioned previous'*, the convener is a transducer that converts the electrical 
signal to a mechanical vibration 

Process Development 

To define a process for welding these parts R was necessary to understand how 
each variable affected the quality of the weld Moreover, a methoo for quantifying 
the quality of the weld joint was needed The quality of the joint was measured by 
how well it met the pan objectives, namely parallelism and leak tightness. The 
parallelism was measured with a drop indicator, a fairly trivial task Deciding how 
to check for leaks, however, became a long affair with many unacceptable meth- 
ods tned. At long last, 't was decided that a sensitive pressure leak tester would 
be sufficient for leak testing The quest became one of correlating the inputs 
(pressure, weld time, down speed, trigger force, and booster) witn the outputs 
(leak rates in three chambers and datum parallelism). One of the largest advance- 
ments in process development came from using an experimental design and sta- 
tistical analysis program to model the correlation This program proved to be 
invaluable in choosing the parameters for the process 

The use of the program was an iterative process A very broad-based experiment 
was first designed to determine which variables and interactions were significant 
These results often yielded trends m variables that would further limit the possible 
operating conditions. These possible operating conditions were then used as center 
points for narrower experiments This iteration continued until an acceptable 
operating condition was attained 

Pitfalls 

This has not been an easy road There have been some pitfalls associated with 
ultrasonic welding as well These include horn manufacture and reliability, and 
tight plastic molding tolerances. 

As described above, the horn is an acoustically tuned pan used to transmit the 
vibratory energy into the plastic parts The two metals commonly used for horn 



manufacturing are aluminum and titanium because of their good acoustical prop- 
erties During a horn's use. it undergoes longitudinal flexure and compression The 
stresses involved depend upon the amplitude erf the strain wntwi a horn element 
Homs are usually manufactu'ed to handle half-waveiengtn and full-wavelength 
vibrations Half-wavelength homs have one node, while full-wavelength norns 
nave two nodes During viorati&n. the material at or near a node undergoes me 
largest stresses The larger the stresses, the shorter We horn life, as is the case 

This stress riser proved to be the source of fatigue cracking m these homs By 
changing tne design to a fuli-wavelengtn hom. we moved the stress nser to a 
more benign location, farther away from a nodal region The full-wavelength hom 
was chosen as the long-term solution for the manufacturing process 

Further problems were encountered because of plastic pan variation When the 
multiple-cavity molds were brought up. there were numerous permutations of 
nosepiece and body combinations Of an the possible pan combinations, only a 
small fraction worked with the existing process Finding a process that wpuld 
work for all pan combinations was extremely challenging. Tnere are three pos- 
sible alternatives for finding a solution to this problem make the pans closer to 
the same Idecrease the tolerances for acceptability), find a very wide process 
window that will weld parts that are not quite identical, or some combination of 
the two The third solution was chosen. Trade-effs in pan tolerances and process 
margins were made. By cnangmg certain pan tolerances, a process window was 
established that could weld all pan combinations. This challenge will remain 
whenever a new mold is brought on line, as long as this joint design is in use 
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Adhesive Material and Equipment 
Selection for the HP DeskJet 
500C/DeskWriter C Color Print 
Cartridge 

The adhesive joins the printhead to the cartridge body and maintains color 
ink separation at the interface. The encapsulant protects the electrical 
bonds. Special equipment was designed to dispense these materials with 
high precision in very small volumes. 

by Douglas J. Reed and Terry M. Lambright 



Structural adhesive is the material that creates the bond 
between (he active portion of the print cartridge (the 
printhead) and (lie passive portion (the tart ridge body). 
Adhesive requirements tor the three-color IIP DeskJet 
500C'/Desk Writer C print cartridge are Tew, but critical. Suffi- 
cient bond strength musi be provided overall environmenlal 
conditions throughout the entire lifetime of the print car- 
tridge ( a maximum shelf life of eighteen months and a maxi- 
mum in-print er service life of six months). This requirement 
is very similar to the corresponding requirement for (he orig- 
inal monochrome ink cartridge introduced with the III' 
DeskJet printer. However, the three-chamber design of the 
color cartridge adds the important function of maintaining 
color ink separation at the interface between the printhead 
and the cartridge body. 

The primary role of (he encapsulate is lo provide protection 
for the bonds that are created between (he TAB (tape auto- 
mated bonding) circuit leads and the bond pads on the print- 
head. The TAB circuit is the device that carries electrical 
signals from the printer to the resistors on the printhead. 
The encapsulant also coats and protects the portion of the 
TAB leads that spans the gap between the edge of the TAB 
circuil and the edge of the printhead. 

Stress Tests 

To decrease product development test lime, various acceler- 
ated stress tests were used to verify the acceptability of the 
adhesive systems over the projected lifetime. These stress 
tests exposed the adhesives (and the entire print cartridge in 
some cases) to mechanical, (hernial, and other environmen- 
tal s(resses (hat were meant lo reveal weak points in the 
design and assembly of the print cartridge. Because (he ink 
is in coniacl with the adhesives during these tests, chemical 
interactions with the ink components are also accelerated. 
The rigorous test regimen used in this project was helpful in 
understanding the root causes of various failure modes. 

Cycle-Time Constraints 

To obtain (he desired manufacturing cycle time, it was nec- 
essary that the Structural adhesive and the encapsulant 



reach full cure in a relatively short time. Material selection 
was therefore restricted to adhesives that could be rapidly 
cured by an appropriate energy source. Multiple stations 
were necessary to obtain the required throughput, but by 
selecling quick-cure adhesives, cost was minimized by 
reducing the total equipment needs. 

Compatibility with Machine-Vision Inspection Systems 

During the manufacturing process, the bead of Structural 
adhesive is inspected by machine vision to verify the pres- 
ence and the quantity of adhesive in critical regions (see 
article, page 87). As a result, it was necessary to select a 
material that has high visual contrast with (he pen body so 
the vision system can process the image. In fact, the final 
material selection was partially based on this requirement, 
since I he compel ing materia] had very little contrast with 
the black print cartridge. 

Mechanical Properties 

Mechanical properties are primarily determined by the adhe- 
sive type and the degree of cure developed during the as- 
sembly process. Because of manufacturing constraints and 
the asymptotic nature of chemical reactions, it is rarely pos- 
sible (or even desirable) to obtain 100% cure. In practice, the 
adhesive is cured to less than 100% because (he desired level 
of mechanical and chemical properties can be obtained 
under those conditions. 

To select the proper structural adhesive for the color print 
cartridge, several thermoset systems were evaluated. All of 
them were fast, thermal-cure sysiems (ypically used in the 
electronics industry. The primary' differences between the 
competing systems were the type of catalyst and the level of 
additives, fillers, and adhesion promoters. These different 
additives provided a range of adhesion levels, solvent 
resistance, cure rates, and oilier characteristics. 

A uniform layer of adhesive is more critical for the color print 
cartridge than for the monochrome application. This is be- 
cause the dispensed bead on the color cartridge is narrower 
and therefore the probability is higher that an adhesive defect 
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will allow ink leakage between colors or to the external en- 
vironment. The narrower head widths also make it neces- 
sary to have higher adhesion strength on a unit-area basis. In 
the three-channel design, no extra room is available to lay 
down additional beads to provide higher adhesion. As pre- 
viously mentioned, this initial adhesion had lo he main- 
tained as much as possible throughout the life of the print 
Cartridge. Adhesion retenlion is a direct (unction of the 
cleanliness of the substrate (cartridge). Ihe degree of cure, 
and the effect of long-term interactions with the ink. 

The encapsulant materials that we investigated included two 
different chemical families. The material used on Ihe origi- 
nal monochrome cartridge has some advantages in terms of 
good adhesion lo the prinlhead. However, its chemical resis- 
tance is not great enough to withstand the new ink chemis- 
tries introduced with the color prtnl cartridge. The new en- 
capsulanl has better chemical resistance even though it is 
slightly softer and has lower adhesion than the earlier mate- 
rial. This encapsulant surv ived the rigorous si rcss lesls 
described above, which destroyed Ihe previous material. 

interaction with ink Components 

Many of ihe adhesives met all of Ihe performance require- 
ments immediately after cure, but il was crucial lo deter- 
mine how the original material properties were degraded 
after exposure lo inks and lo ink components. To evaluate 
the chemical interactions, small rectangular blocks of each 
adhesive were poured, cured, and then exposed lo ink under 
various environmental conditions. 

As shown in Fig. 1. Ihe tesi blocks revealed significant in- 
formation about how Ihe materials performed alter immer- 
sion. Information gathered from this lesi and others enabled 
us lo make two critical determinations: ( 1 ) Did the ink leach 
sufficient constituents out of ihe adhesive lo affect ink 
quality?, and (2) Did the adhesive absorb sufficient ink 
Components lo significantly affect Ihe mechanical and 
dimensional properties of Ihe adhesive? 

In some cases, Ihe solvent cotnponenls of Ihe ink leached 
polymeric constituents from the adhesive causing Ihe ink to 
become contaminated and Ihe adhesive lo lose weight. In 
oi her cases, weigh! gain occurred because critical ink com- 
ponents were absorbed by Ihe adhesive and depleted from 
ihe ink. In both cases, il prov ed necessary to evaluate the 
Interactions by performing prinl quality lesls. 



Mechanical degradation of the adhesive was also caused by 
ink interactions. These effects were exhibited as adhesive 
swelling, loss of hairiness, stress cracking, and crazing. 
Again, the largest factor here was material type. Postimmer- 
sion push tests were used to verify adhesion and integrity of 
the structural adhesive when tested on cartridge bodies. 

Dispensing System Design 

Applying structural adhesive and encapsulant to the color 
prinl cartridge presented significant design challenges 
compared to the original DeskJet print cartridge applic alioti. 
The trichamber design of the color cartridge and the two- 
piece assembly process placed more stringent demands on 
the dispensing systems. This provided the opportunity to 
advance our capabilities in small-volume dispensing technol- 
ogy and allowed us to build upon our previous manufactur- 
ing experience with inkjet products. After searching for out- 
side vendors ihai could meet ihe requirements, it became 
obvious thai we would need to design tools in-house to meet 
ihe required degree of accuracy. Inkjel technology has con- 
linually pushed Ihe limits of standard adhesive technology 
and dispensing requirements. This new opportunity de- 
manded that we design superior dispense systems thai 
would lend themselves to our unique applicaiions and allow 
us to leverage those capabilities into future manufacturing 
processes. 

Structural Adhesive Dispensing Requirements 

As mentioned earlier, the structural adhesive had the dual 
role of bonding the prinlhead lo the cartridge and maintain- 
ing physical separalion between the three ink channels and 
between the ink channels and Ihe oulside world. These Func- 
tions had lo be accomplished in an area I hat is no larger 
than thai available on Ihe original monochrome can ridge 
used in the DeskJet family of printers. Compared to pre- 
vious products. Ihe required bead width was approximate!) 
one-third smaller and Ihe placement tolerance was aboul 
tWO-thirdS smaller. 

Encapsulant Dispensing Requirements 

The encapsulation process is also more stringent than its 
predecessors. Previous applications controlled only encap- 
sulation height and encroachment onto the orifice plate. 
However, lo permit a differenl sealing mechanism in the 
printer Service Station, the color cartridge requires control 
on all sides Of the dispense pattern, In addition to control- 
ling orifice plale encroachment (for proper printer wiping) 
and height (to prevent crashing into the paper), ihe color 
cartridge has the additional constraint of maintaining a 
"free" area outside the encapsulation zone to allow access 
by the printer service station. This means thai Ihe process 
specifications are a factor of iwo lighter than what had pre- 
viously been required for encroachment as well as requiring 
dimensional control in five directions (both sides, both ends, 
and height ) instead of Ihe previous Iwo. The final problem 
for Ihe dispensing system was dial Ihe cartridge body is as- 
sembled from two pieces — a nosepiece welded loa body. 
This prov ided ihe challenge of an increased lolerance slack- 
up when ihe pari was presenled lo the dispenser and made 
il likely that Ihe dispensed surface would not be flat. A 
background concern affecting all of Ihe Others was ihe 

knowledge thai a sleep production ramp would necessitate 

low cycle limes and high reliability. 
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Tooling Objectives 

The primary objectives for the tool included controllable 
adhesive volume delivery a multiaxis closed-loop control 
system, and a system to locate cartridge position. Means had 
to be provided (0 locate and calibrate the dispense tip so the 
system could perform without operator intervention. 

Adhesive dispense systems bom the electronics and auto- 
motive industry were evaluated, site \isils were conducted, 
and trade show information was digested. None of the sys- 
tems we evaluated would provide the tolerances or multiax- 
is synchronization that was required. While some of the dis- 
pensers in the automotive industry' had the desired 
closed-loop features, the differences in scale were just loo 
large to bridge. It was ultimately concluded that no other 
applications required the same combination of precision, 
speed, and small dispense volume. 

Production Equipment 

A CNC-type controller was selected to provide I he required 
four-axis synchronization. Cartridge location is determined 
by position sensors, allowing us 10 locale the pari in space 
in all three axes. Dispense tip location is determined by mul- 
tiplexing the vision engine thai was already in place for on- 
line inspection of the dispense pattern. With boih the car- 
tridge and I he dispense tip located in space, the dispense 
pattern can be adjusted for each individual can ridge. We 
selected a controllable dispense mechanism that is force-led 
from a pressurized syringe. This solution provides most of 
the benefits of true positive displacement even though it has 
the drawback of noi accommodating viscosity changes in 
the adhesive. Fig. 2 shows the adhesive pattern. 

The final syslem design meets all of the original demands 
thai were imposed by the color print carl ridge. It also goes a 




Fig. 2. Adhesive pattern dispensed hy the production equipment 



long way toward advancing our understanding of dispensing 
technology. Finally. 1 1 it • system provides us with the capabilit.v 
of modifying the design in the future to provide true positive- 
displacement dispensing for other applications thai require 
similar precision. 
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Machine Vision in Color Print 
Cartridge Production 



In production of the tricolor print cartridges for the HP DeskJet 500C and 
DeskWritei C printers, machine vision is used for filter stake inspection, 
adhesive and encapsulant dispenser calibration, structural adhesive 
inspection, and automatic print quality evaluation. 

bj Michael J. Monroe 



Machine vision can be described as ihe synthetic- acquisi- 
tion, analysis, and interpretation of images, usually to pro- 
vide feedback and control tor some automated activity. Ma- 
chine vision has been implemented through the marriage of 
video camera and display systems to computer technology, 
and it is often associated with some form of artificial intelli- 
gence. Most machine \ision applications entail massive data 
reduction from Ihe millions of bits that represent the images 
to often a single bit indicating a pass or fail status. 

Machine vision is usually used for machine or robot guid- 
ance, defect Identification and classification, part and as- 
sembly alignment, and feature measurement. Automated 
production environments are the ideal home for machine 
vision. There it can be applied to relatively simple, repetitive 
tasks, and cycle lime is of critical importance. These ap- 
plicaiions arc distinguished from the more generic digital 
image processing used in areas such as astronomy, biosci- 
ence, and satellite image enhancement, for which Ihe algo- 
rithms tend to be very CPU-intensive and exec ution t ime is 
of less importance. 

Increased Quality is usually Ihe primary motivation forau- 
tomaling a task with machine vision. Machine vision can 
eliminate Ihe subjectivity often found in manual inspection 
operations. Quality can also be increased through reduced 
inspection error rates by eliminating operator fatigue. The 
improvements in quality thai machine vision can help attain 

may be vital to the long-term competitiveness of a 

manufacturing operation, 
□lamination and Opt ics 

No discussion of machine vision would be complete without 
emphasizing the importance of illumination and optics. 
These aspects are critical to nearly all machine vision ap- 
plications, and indeed the viability of a particular applica- 
tion may hinge upon ihe design of the illumination and opti- 
cal systems. Careful design of Ihe illumination System for a 
particular machine vision application can provide enhance- 

meni of contrast or of certain features of interest In the field 

of view, or it can be used to filter out features that may con- 
fuse the algorithm. One obv ious advantage of image en- 
hancement through illumination is that it operates on the 
entire image instantaneously, much faster than any digital 
image processing. Techniques suc h as light and dark field 



illumination can greatly improve the contrast between fea- 
tures of interest and the bac kground. Linear or circular po- 
larization of the illumination or application of on-axis or 
off-axis illumination can mitigate the effects of specular 
reflection from metallic or polished surfaces, which can 
Often saturate the imaging sensor within the camera, caus- 
ing the resultant image to be of little use. In some applica- 
tions there is no usable contrast between the features of 
interest and the background, and imaging using illumination 
in the visible wavelengths is impossible. However, ultravio- 
let illumination, to which most cameras have reduced sensi- 
tivity, can induce some materials to fluoresce in the visible 
region of the spectrum, thereby providing usable images. 
Some machine vision applications require only the informa- 
tion that is contained in the outline or silhouette of the ob- 
ject of interest Backlighting or placing the objeel between 
the camera and the illumination source can provide images 
of suc h high contrast thai they are nearly binary in nature. 

Design of Ihe optical system is equally important. CCTVor 
('-mount lenses, commonly employed with standard video 
cameras, sue often used in machine vision. They are inex- 
pensive, but I hey provide resolution and contrast that are 
marginal or inadequate for many applications, l-enses de- 
signed for use with standard :i">-nuu cameras or phoioen- 
larging lenses are somewhat more expensive, but they pro- 
vide much higher resolution and contrast, and the control of 
aberrations such as distortion and field curvature is much 
better. For applications that require extremely high resolu- 
lion ami magnification, micrographic lenses are usually Ihe 
besl choice. The key parameters to be considered during the 
design Ofthe Optical system are the working distance or Ihe 
dislance between the lens and the object, the magnification, 
and the focal ratio. The working dislance and the required 
magnification are used to determine Ihe focal length of the 
leasing system, and the focal ratio is a function of ihe avail- 
able Ulumtnation. Other considerations such as shock and 
vibration and rigidity of the mounting systems are also very 
important. 

Soft w are 

The often crucial nature of the illumination and optical sys- 
tems design should not overshadow Ihe design of Ihe soft- 
ware that is to be run on Ihe machine vision engine. During 
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the development of most machine vision applications, the 
miyoriiy ofthe iirno is devoted CO the design, coding, and 
debugging Of this software, The software for a typical ap- 
plication can hp broken into five segments. First is image 
acquisition, which entails synchronization ofthe video 
source and then digitization and storage of the image data 
into frame buffer memory. The most common video source 
is a standard KS-170 monochrome camera, which transfers 
l he data representing a full frame in 33 milliseconds. After 
acquisition of the image, some son of enhancement of the 
features of interest is usually done. This may consist of high- 
pass or low-pass convolution filters for treatment ofthe 
edges ofthe features, or morphological filtering, which can 
eliminate noise pixels in an image, linage segmentation usu- 
ally follows to provide some means of separating the fea- 
tures of interest from the background or other extraneous 
parts ofthe image. Binarization or thresholding of the image 
is a segmentation technique used to convert an image repre- 
sented by many gray levels to one consisting only of regions 
of pure black or pure white. This method also greatly re- 
duces the amount of data that must be processed during the 
analysis ofthe image. Feature extraction is then used to 
derive data on the details of features of interest that w ill be 
used during the Interpretation phase ofthe algorithm. Analy- 
sis ofthe gray level sums ofthe pixels in all ofthe rows and 
columns of an image results in profile information such as 
the position, size, or shape of features. Template or patient 
matching uses a previously stored model to locate the posi- 
tion of a matching model In the acquired image. Connectiv- 
ity analysis provides detailed information on linked areas 
within a binary image. Morphological analysis can provide 
data on the shapes of various features of interest, in addition 
to its usefulness as a noise filter Finally, all ofthe acquired 

data representing the image must be interpreted so thai 

some useful result can be found. This may lake the form of 
calibration or position coordinates that can be used for ma- 
chine guidance, or it may simply be the pass or fail result of 
an inspection task. 

Machine Vision in InkJet Component Production 

Hewlett-Packard has used machine vision for years 10 im- 
prove efficiency and quality in the manufacture of integrated 
circuits, printed circuit assemblies, calculators, thermal ink- 
jel printheads, and many other products. The Inkjet Compo- 
nents Division has been a leader in the incorporation of Ibis 
technology into the production processes of its products. 
Beginning in 1983, machine vision has been used in the final 
assembly of the HP Thinklet print cartridge to inspect for 
defects such as poor Structural adhesive placement and 
leaks. In addition, samples from every prim cartridge are 
analyzed using machine Vision to screen for various print 
quality defects. These machine vision applications were fur- 
ther refined and new ones were dev eloped for die manufac- 
ture of the IIP DeskJet print cartridge. Machine vision is 
used to assisi very high-precision pan alignment and place- 
ment machines in the attachment of orifice plates to the 
thin-film substrates ofthe print head assembly and in the 
placement ofthe printhead assembly onto the plastic print 
cartridge body.' In addition, a new fully automatic high- 
speed print quality tester was developed using machine 
vision to inspect print samples of 101) percent ofthe print 
cartridges manufactured. 



Color Print Cartridge Production 

In the early lSXSOs. when general-purpose machine vision 
engines were first made available, many different manufac- 
turers offered products that were similar in function bin 
very different with respect to instruction sets and software 
development environments. Since that time, there has been 
a shakeoul in the market place, and fewer different machines 
are being offered today. However. I he products that remain 
still tend to be very proprietary in nature, and little if any 
effoil has been made to create the lype of standards (hat 
exist today in the computer market. This lack of standard- 
ization and especially the preponderance Of proprietary 
instruction sets can greatly reduce the efficiency with which 
new machine vision applications can be developed. There- 
fore, it is of great benefit to choose One product t hat has 
adequate capability for most if not all applications. This 
benefit of standardization is particularly important in die 
manufacturing environment, where machine vision applica- 
tions lend to proliferate and support becomes a significant 
issue. 

For the HP DeskJet "lOOC/DcskWriier C print can ridge pro- 
duel ion line we have chosen an engine that offers a range of 
Optional features and capabilities, but uses a common 
Instruction set and software development and debug envi- 
ronment throughout In a similar vein, a single line of very 
high-quality phoioenlarging lenses has been selected to pro- 
vide several focal lengths and focal ralios thai are suitable 

for most machine vision applications. 

Filter Stake Inspection 

Because the printhead for the Deskjet 500C printer is a tri- 
color printhead il has a Separate chamber for each of die 
cyan, magenta, and yellow primary colors. At the bottom of 
each of these chambers near I he nose Ofthe print cartridge 
there is a slaiuless-slcel filler screen that is healslaked to a 
standpipe. The filler slaking process Is a Critical one, and 
machine vision has been Incorporated to ensure that every 
print cartridge body coining from the filter staker tool has 
three fillers that are properly slaked to the siandpipes ofthe 
print cartridge body. Each filler is inspected for presence, 
proper centering on the standpipe, and an adequate weld 

ring Indicating satisfactory attachment to the standpipe. The 

filler staker consists of an indiv idual slaking station for each 
of I he three fillers, and each of these stations is equipped 
with camera, optics, and illumination to enable inspection 
immediately alter completion ofthe slaking operation (see 
Fig. 1). 

The filler slake inspection process is a good example of a 
machine vision application in which optics and illumination 
determine the viability ofthe process. Because of the flow 
characteristics of die plastic material used in the print car- 
tridge body, relatively little ofthe material is drawn up into 
the weave ofthe filler screen during die heatslaking pro- 
cess. As a result, images dial displayed little or no Contrast 
between the weld ring and die filler were all that could be 
achieved wilh nearly on-axis illumination, Il was found thai 
contrast could be enhanced somewhat by increasing I lie 
angle of illumination from Ihe normal lo I he filter. However, 
this angle was limited by the shadowing caused by the walls 
of the prim cartridge body. At this point, contrast was slill 

inadequate for reliable imaging, and further development 
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Fig. L the Bltsr stake ttapectfon station uses off-axis illumination 
and polurix.ing optica. 

was required. Il was determined that although the Biter 
screen material is relatively rough and provides reflection 
thai is primarily diffuse In nature, there is also a specular 
component that lends to saturate the Imaging sensor in the 
camera and thereby further reduce contrast. Therefore, lin- 
ear polarization of the illumination source is used to miti- 
gate t he effects of specular reflection and produce images 
adequate for a robust inspection process. The angular 
orientation of the filter screens greatly affects the reflec- 
tance of the material, and as a result, the images vary from 
brightness levels too low to be of use to levels so high as to 
cause saturation of the imaging sensor. In most machine 
vision applications, the camera's automatic gain control 
(AGC) is disabled so thai the gray levels digitized by the 
vision engine provide a more accurate representation of the 
object. However, it was found that the camera itself could 
be made to cope with the wide variations in brightness 
simply by reenabling the AGC. 

The inspection algorithm is based upon gray-level row and 
column profiling for determination of filler centering on the 
standpipe, and gray-level pixel sums and ratios of sums to 
ascertain filter presence and gauge the continuity of the 
weld ring. The inspection based upon the analysis of gray 
levels is very robust and provides false-pass and false-reject 
rates that are nearly zero. In addition, image acquisition and 
processing are extremely rapid, and a single machine \ ision 
engine can support all three of the inspection stations. At 
the completion of the inspections, the lest results arc sent to 
the PLC (programmable logic controller) that controls the 
filler staking process, and that data is then transferred to a 
failure off-load station via pallet code blocks. 



Adhesive and Encapsulant Dispensing 

The print cartridge for the HP DeskJet 500C printer contains 
three inks of different colors, and it is critically important 
Dial complete separation between these inks be maintained 
at all points throughout the print cartridge. Any ink mixing 
will result in contamination of the inks and of the colors that 
they produce on the printed page. The structural adhesive 
that bonds the thin-film printhead to the nose of the print 
cartridge plays a vital role in providing the required ink 
isolation. Because all three inks an? channeled to the same 
printhead, the tolerances associated with the placement of 
the partem of structural adhesive are more restrictive than 
those of other thermal inkjet print cartridges. As a resull. it 
was recognized that some means of providing an efficient 
and accurate calibration of the exact position of the needle 
tip of the adhesive dispenser was required. A similar require- 
ment existed for the encapsulant dispensing process. Ma- 
chine vision was chosen as an aid to ensure accurate place- 
ment Of the structural adhesive and encapsulant by providing 
the dispenser system controller w ith accurate x. y. and z 
coordinates of the location of the dispenser needle tip. 

Each of the dispenser stations is equipped with two orthogo- 
nally positioned cameras anil associated optics (Fig. 2). Dur- 
ing the calibration process the positioning system moves the 
dispenser needle to a calibration position so that images 
from both of the cameras can be captured by the machine 
vision engine. Through analysis of both of the images the 
vision engine can determine the x, y, and z coordinates of 
the dispenser needle tip in pixel space. Then the coordinates 
in pixel space are convened to coordinates in the coordi- 
nate space of the dispenser positioning system using calibra- 
tion coefficients previously stored in nonvolatile memory of 
the machine vision engine. This position data is then sent in 
the form of offsets to the dispenser system positioning 
controller via an RS-232-C datacomm link. The calibration 
process is fully automatic and requires operator assistance 
only for initiation of the procedure. It is normally performed 
only at the beginning of a shift or when a dispenser needle 
has been changed. 




Fig. 2. The adhesive and ttJicapsulattl dispensing stations are 
equipped with two orthogonally positioned cameras using back- 
lighting to determine the position of the dispensing needle tin 
accurately, 
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All of the data necessary to determine the precise position 
of the dispenser needle can be inferred from the profile or 
silhouette of the needle tip. This provides an ideal opportu- 
nity for backlighting or placing the object to be imaged be- 
tween the illnminalion source and the camera Backlighting 
typically produces an image of extremely high contrast, and 
often the image is nearly binary' with the object appearing 
dark against a bright background. Since the required accu- 
racy of I lie machine-vision-based calibration system is very 
high, a significant amount of magnification is required to 
achieve the necessary resolution. An appropriate photoen- 
larging lens with adequate extension was selected to provide 
the desired field of view. 

The machine vision engine uses an edge-enhanced pattern 
matching technique to locate the dispenser needle tip in the 
field of view to subpixel accuracy. The template or model 
describing the salient features of the object to be located is 
stored in the nonvolatile memory' of I he vision engine during 
a set up procedure. A unique model for each of the cameras 
is stored. These models are then used during each subse- 
quent calibration procedure to determine the locations of 
the desired features in the two images. Also during the setup 
procedure, each of the cameras is calibrated using a preci- 
sion reticle grid placed in precisely the same position as the 
dispenser needle. Nine intersections on the grid are then 
used as calibration points so that an array of coefficients 
can be produced and then used to convert coordinates in 
pixel space to machine coordinate space. The machine vision 
engine has dedicated RS-232-C datacomm pons connected 
to the adhesive dispenser system controller and the encap- 
sulant dispenser system controller. After the offsets in ma- 
chine coordinate space have been computed by the vision 
engine, they are sent to the respective controller via these 
links. Since these calibration procedures are performed on a 
fairly infrequent basis, speed of execution is not of para- 
mount concent. As a result, the two dispenser systems share 
one machine vision engine with the structural adhesive in- 
spection station. The automatic calibration process has 
proved to be robust and to provide precise and rcpeatable 
dispenser needle positioning data to the dispenser system 
controllers. 

Structural Adhesive Inspection 

Although (he machine-vision-based dispenser calibration 
process provides an efficient means of compensating for 
variation in needle tip positions, it cannot prevent other pos- 
sible defects in the structural adhesive pattern. As previous- 
ly stated, the integrity of this adhesive pattern is vital to the 
proper funct ioning of the print cartridge. Therefore, an auto- 
matic inspection station was developed using machine vi- 
sion to ensure that each print cartridge body has an accept- 
able and accurately placed pattern of Structural adhesive 
before reaching the head attach station. Because the adhe- 
sive is very soft and easily disturbed after leaving the dis- 
penser station, it was imperative that a noncontact means of 
inspection be used. The pattern of structural adhesive for 
each print cartridge is visually inspected for proper center- 
ing £pd area of the adhesive and for proper centering and 
area of each of the three channel openings. In addition, the 
patient is Inspected to ensure that the bead of adhesive is 
continuous and free of sections that are excessively narrow 
or wide. 




Fig. 3. I FhproceSSed image of the structural adhesive pattern. 



The structural adhesive that was originally to be used pro- 
vided virtually no contrast when placed on the black plastic 
nose of the print cartridge and illuminated with light in the 
visible portion of the spectnim. However, when ultraviolet 
light was used to illuminate the adhesive, fluorescing com- 
pounds contained in it caused it to appear light blue against 
the black background of the nose. This technique provided 
some usable contrast, but it was marginal and not likely to 
succeed with the variabilities commonly found in a produc- 
tion line environment. A new adhesive was selected. The new- 
material provides images of extremely high contrast and 
allowed the development of a very robust inspection pro- 
cess. Lighting for this application is prov ided by a standard 
fluorescent ring lamp, which supplies adequate illumination 
that is very uniform across the field of view. The standard 
photoenlarging lens provides excellent, zero-distortion 
images (Fig. 3). 

The most serious defect in the structural adhesive pattern 
occurs when an insufficient amount of adhesive has been 
placed by the dispenser station, and a break or discontinuity 
occurs in the bead. Continuous or connected regions in the 
image are most effectively evaluated through the application 
of connectivity analysis. Because the adhesive provides 
images of extremely high and consistent contrast, binariza- 
tion and the subsequent connectivity analysis have proved 
to be a very robust inspection technique. Indeed, the images 
are of such quality that adaptive thresholding, which is usu- 
ally required to compensate for variations in contrast and 
illumination, is not necessary. This results in a decrease in 
the overall cycle time for the process. Morphological filters 
are applied to the binary image to remove random noise 
pixels that could taint the connectivity data, and to reduce 
the width of the bead image so that portions that represent 
insufficient adhesive will actually separate and can be easily 
identified. The database produced from the connectivity 
analysis furnishes information such as area, centroid. and 
perimeter of the connected regions found in the image, and 
a description of the hierarchical relationship between those 
regions. Much of this data is easily compared with test limits 
to determine the adequacy of the adhesive pattern (see 
Fig. 4 ). However, because of the tight tolerances associated 
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Fig. 4. Structural adhesive pattern imago after processing and 
analysis, showing test data and results. 



with the adhesive placement and the positioning errors in- 
herent in the pallet lift and index mechanism, it is essential 
to locale a feature on the nose of the print cartridge with 
which to compare the adhesive pattern location. The central 
channel opening in the nose is located using edge-enhanced 
pattern matching before the connectivity analysis so Ihat its 
position can be used as a reference in the subsequent tests. 

Automatic Print Quality Tester 

The quality of the IIP DeskJet 500C print cartridges leaving 
the production line is of the utmost concern. This concent 
precipitated the development of the automatic print quality 
tester, on which 100 percent of the print cartridges manufac- 
tured are tested. The automatic print quality tester is the final 
ar&iter Of print cartridge quality, and it is its responsibility lo 
provide information regarding print cartridge performance 
so that the defective ones can be culled from those thai are 
acceptable at the end of Hie production line. The automatic 
print quality tester is capable of detecting a wide variety of 
print cartridge defects. Il is a fully automatic tester ihat re- 
ceives print cartridges seated in nests on a rotary table, fires 
I he print can ridges so that test patterns are printed on pa- 
per, and then uses machine vision to examine and evaluate 
the printed patterns to detect any possible defects in ihc 
print cartridges. At the completion of the tests for a given 
print cartridge, test results in the form of fail codes are both 
stored locally and sent to a data collection system so that 
yield and defect summary reports can be prepared. The local 
database can be interrogated using online utilities, and yield 
and defect distributions can be displayed in near real time. 

The automatic print quality tester uses a relatively straight- 
forward optical and illumination design that incorporates 
high-quality macroscopic lenses and fiber optic ring illumi- 
nators (Fig. 5). The challenge was how best lo deal with the 
cyan, magenta, and yellow primary colors. Standard color 
imaging techniques using a color video camera with RtiB 
outputs were considered but rejected because frame storage 
memory requirements were tripled relative to those for 
monochrome imaging, and most color cameras provide sig- 
nificantly reduced spatial resolution, in addition, before 
meaningful color image analysis can lake place. I he contents 



of the frame buffers thai contain the gray lev els representing 
the red, green, and blue video camera outputs must l>e com- 
bined and converted into a more useful color coordinate 
system such as HSI (hue-saturation-intensity). The time re- 
quired to make these conversions in software was prohibi- 
tive with respect to the overall cycle time of the tester. 
Because nozzle defects and not hue shifts are the most com- 
mon defect types, it was decided that true color imaging was 
not required, and an approach using color filters w as taken. 
If the machine vision engine is presented with images of 
high enough contrast with the ink appearing dark on the 
white paper background, it can easily perform the required 
analyses to determine the health of the print cartridge. The 
cyan and magenta primaries provide usable contrast without 
any special optical techniques, but the yellow ink is nearly 
invisible to the CCD camera. By placing a blue interference 
filter in front of the camera lens, the yellow test patterns are 
made to appear dark enough to provide usable contrast in 
lite image. A similar enhancement is afforded by a green 
interference filter on the camera that is used to image the 
cyan and magenta test patterns. It greatly increases the con- 
trast of the magenta patterns, but has little effect on the 
cyan. This technique in effect reduces the number of images 
that must be acquired for each test pattern from three to 
two, and it eliminates the necessity of time-consuming color 
coordinate conversions. 

As previously mentioned, the automatic print quality tester 
ensures the quality of the print cartridges leaving the pro- 
duction line. Special test patterns were developed to reveal 
the several lypes of possible prim cartridge defects. These 
lest patients help ensure Ihat the machine vision engine can 
provide an effectiv e screen for defective print cartridges. 
These patterns are printed by the print cartridge under test, 
and after completion of the printing phase. I he paper is ad- 
vanced (Fig. (i) until the lest patients are Within the fields of 




Fig. 5. A view Of III" automatic print quality K'stel Miuvviug tin- 

illuminators, cameras, optics, rotary table, and print cartridge 
nests. 
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Fig. 6. A view of I ho automatic print quality tester Showing the 

paper (feed and take-up risels and tension arms. 

view of the cameras. Signals sent between the paper motion 
controller and the machine vision engine synchronize the 
acquisition of all of the bes) pattern images. Tlie machine 
vision engine then processes and analyzes the images using 
techniques such as row and column profiling, edge enhance- 
ment, and grey-level pixel sums to identify any defects pres- 
ent. Al the completion of t he image analysis phase, fail 
codes that represent any lest pattern defects that may have 
been detected are sent lo the HP-UX*-baSed system control- 
ler via an RS-232-C serial data link. Because of t he physical 
separation between the cameras, and for efficiency and 
throughput considerations, during any given test phase the 
machine vision engine is evaluating the test patterns from 
two different print cartridges. It is the responsibility of the 
system controller in receive and sort the fail codes for these 
print cartridges properly so that all of the failure informa- 
tion for a particular print can ridge is associated with the 
bar-code number for I hat print cartridge. This failure data is 
then both stored in a local file and sent to the production 



line data collection system via a local area network. The 
automatic print quality tester lias demonstrated that a fully 
automatic machine is capable of effective print quality as- 
sessment, and that it can ensure that the print cartridges 
leaving the production line are of the highest possible quality. 

Conclusion 

The high-volume production of thermal Inkjet print car- 
tridges requires many very accurately controlled and repeat- 
able processes. Most of these processes are implemented 
using well-designed, high-precision automatic machines. 
Machine vision has played an essential role in the calibra- 
tion, inspection, and control aspects of many of these pro- 
cesses, and it has helped ensure interprocess quality as well 
as the quality of the final product. Machine vision is a key 
ingredient in the further evolution of computer integrated 
manufacturing. As the technology Continues to advance and 
the cost of its use continues to decrease, an ever-expanding 
number of applications will be found for machine vision. 
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HP DeskWriter C Printer Driver 
Development 



Running on the host computer, the driver provides all of the intelligent 
formatting, rasterizing, color matching, and dithering for this affordable 
black and color printer. 

by William J. Allen. Toni D. Courville, and Steven O. Miller 



A printer driver is a program thai provides an interface 
between an application program and a printer. In the Mac- 
intosh and Microsoft Windows environments, the applica- 
tion/driver interface is well-defined. This allows a single 
driver to serve all applications in a particular environment. 

The HP DeskWriter C and DeskJet 500G printers use the 
same print cartridges and mechanical components to mark 
the page. Basic print modes and color imaging techniques 
are the same for both products. To improve clarity, this ar- 
ticle focuses on the DeskWriter (' (Macintosh I driver. Where 
appropriate, significant differences in the DeskJet 500C 
(Microsoft Windows) driver will be pointed out. 

To be competitive in the marketplace, a low-cost printer 
manufacturer must provide drivers for the two most popular 
windowing environments, Microsoft Windows and the Mac- 
intosh operating system, Manufacturers of high-end printers 
can always include the widely used page description language 
PostScript to guarantee support of the printer However, 
including PostScript in a low-cost color printer like the IIP 
DeskWriter C WOUld he prohibitive, significantly increasing 
the price of the product. 



The alternative is to build into the printer only the logic nec- 
essary to put the dots onto the paper fast enough to keep 
the mechanism busy. This requires the driver, running on the 
host machine, to provide all of the intelligent formatting, 
rasteri/.ing (convening logical graphics objects to a bit 
image), color matching, and halftoning. This approach, of 
using the host machine's CPU power to create the raster 
image to be laid down by the printer, is the one that we lake 
with the MP DeskWriter C. 

HP DeskWriter C Printer Driver 

The IIP DeskWriter G driver is a program that sits between 
the user, the application, and the operating system as shown 
in Pig. I. This diagram shows the major paths of commu- 
nication and control managed by the driver. 

The user creates a document using the application, then 
chooses Page Setup, causing the application to call the printer 
driver's Page Setup command. The printer driver puis up the 
Page Setup dialog box for the user and returns the modified 
page si/.e and printer attributes to the application. The 
application can use this information to reformat the 
document for the new attributes. 
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The user then prints the document l).v selecting Print, causing 
the application to call the printer driver's Print command The 
printer driver puts up the Print dialog box for the user, and 
when the user is finished making selections the printer driver 
allows the application to send it a series of page descriptions. 
These page descriptions are a sequence of imaging com- 
mands which describe the page as a' series of objects such 
as text, lines, circles, and raster images (PixMaps). The 
imaging commands are in QuickDraw, the drawing conunand 
language of the Macintosh (similar to (J[)l. or Graphics 
Device Interlace, in Microsoft Windows). 

Now the printer driver will immediately return control of 
the computer back to the user if the user has enabled back- 
ground printing, or it will continue processing the print job 
in the foreground. In any case, (he printer driver opens a 
communication path to the printer (either serial or Apple- 
Talk), determines which print cartridge is installed in the 
printer, and then proceeds to use a combination of Quick- 
Draw and its own built-in functions to rasterize the page 
description into a 150-dpi or:J00-dpi PixMap. 

This PixMap is then adjusted to compensate for the differ- 
ences between the display and MP printing technology. 
This may include color matching for the current media, 
which attempts to make the printed colors appear the same 
as I he colors on the monitor. 

The PixMap is finally halftoned, which involves using vari- 
ous patterns of printed dots to simulate all colors that can 
be produced on the monitor. For instance, since the printer 
only has cyan, magenta, and yellow inks, it can't directly 
produce a purple dot. Purple is produced by printing a mix- 
lure of red and blue dots in a checkerboard pattern. Each 
red dot is made by printing a yellow dot on top of a magenta 
dot and each blue dot is made by printing a magenta dot on 
top of a cyan dot . This halftoned data is then compressed 
and transmitted to the printer. 

While a communication path is open to the printer, the driver 
continuously receives status information from the printer so 
it can report error conditions such as "out of paper" or 
"wrong print cartridge installed" (e.g.. the color print car- 
tridge is installed, but the document only contains black 
data). 

Rasterization 

Fig. 2 gives a closer look at how the printer driv er rasterizes 
a page. The transformation from page description to PCL 
data takes place in several steps. The rasterizing module 
uses QuickDraw to do much of the work since the page 
description is in a set of QuickDraw commands anyw ay, but 
it replaces some of QuickDraw's functionality where neces- 
sary, such as in the rendering of text. Before Apple's System 
7.0. QuickDraw did not provide scalable outline fonts, so the 
I IP DeskWriter C rasterizer contains its own outline font 
renderer called Intollifont™ from AGFA Coiupugraphic. 
This provides all Macintosh users with access In high- 
quality scalable outline fonts for the most commonly 
used typefaces. 

Because most computers have limited RAM, and rasterizing 
a ;300-dpi page can take a lot of memory ( up to 32 megabytes 
to represent a full color page), the driver usually rasterizes a 
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Fig. 2. This is the process fur rasterizing a page in the HP Desk- 
Writer C printer driver. The rectangles denote data and the ovals 
denote code. 

portion of the page at a time in what is called banding. Band- 
ing is pictured in Fig. S. The less available RAM the comput- 
er has. the more bands it takes to rasterize the whole page, 
and the longer it takes to complete the whole page, because 
there is a fixed ov erhead in rasterizing each band. Each ob- 
ject must be examined to determine if it falls within the 
band. If it does then it is rasterized (drawn) into the band. 
Otherwise, it is ignored (clipped). Objects that cross band 
boundaries must be rasterized for each band thai they touch. 
For most common documents, banding adds little or nothing 
to the print time. Only for very complex documents that 
have a large number of objects, or for very low-memory con- 
ditions. which increase the number of bands, does banding 
significantly affect the time to rasterize the whole page. 
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Fig. 3. S Ml liie host computer does not liave enough available memory to rasterize a complete page, banding is used, (a) This jwge repre- 
sents a 250-color document (8 bits per pixel) To rasterize it at 300 dpi would take about 8M bytes of memory (b) If only 2M bytes of 
memory are available for rastprizing. then only a quarter of the page can be rasterized D a Ume. (c ) The solution is to use the 2M bytes of 
memory ,0 rasterize ihe lop quarter of the page As soon as this band is rasterized and its colors are adjusled. it is dithered, compressed, 
arid transmitted 10 the printer WtaBe tlus data is being transmitted, rasterization of the second quarter of the page begins, using the same 
2M bytes of memory fO buffers ensure that printing continues while successive bands axe being rasterized. 



User Interface 

This product had a wide range of human factors issues and 
challenges, all of which were analyzed, designed, and tested. 
The issues included defining a concept ual use model of the 
product, hardware design, and integrating aspects of color 
imaging into a simple driv er interface. 

To design a produci thai meets the requircmcnls of our users, 
we needed lo find out Who our users are. The key daia point 
returned from market research was that the HP DeskWriter 
C should be designed for home and business users who do 
not currently use color, and who do not want to struggle with 
their computer or printer. Initial market research and human 
factors concept testing pointed strongly to the necessity for 
a user-friendly produci with good print quality. 

Usability Studies and iterations of the user interface fol- 
lowed and continued for the nexl six months. Usability stud- 
ies are more detailed than concept testing, and are designed 
to test users' reactions to specific aspects of a prototyped 
product To test the user interface, subjects were asked to 
perform numerous printing tasks in a number of different 
scenarios. The users were questioned regarding verbiage, 
ease of use, and functionality offered. ( Hir goal was to de- 
sign a usable driver wiihoul compromising functionality and 
without creating confusion for less-technical users. We had 
three formal usability studies during Ihe produci design 
phase, which required a number of iterations to the driver's 
interface. The results from each study were published by 
our human factors engineer along with design recommenda- 
tions for the next iteration. These design recommendations 
were based on the users' responses to questions, task 
completion successes or failures, and other observed data. 

Dialog Boxes 

Fig. 4 shows the final dialog boxes of the user interface. The 
first two dialog boxes, Print and Page Setup, are accessed from 
the menu bar. A key design objective was to make these 
screens as simple and uncluttered as possible. It was de- 
cided that only those functions for which frequent change 
was required would be in the two main dialog boxes. The 
more technical functions were put into the Colors and Options 
screens. This was done to decrease confusion for the typical 
user who would rarely have use for the added features, The 
defaults were chosen such thai most users would not require 
the Colors and Options dialog boxes. 



Features acc essed through the Print dialog box are print qual- 
ity, copies, page range, page order, and print method. Most 
of these printing features are common to other Apple driv ers. 
Print Method Use installed print cartridge only is a feature designed 
specifically for this product. When this checkbox is on. the 
alert messages prompting the user to exchange the print 
cartridge, which normally display when there is a discrepancy 
between the document content and the installed print car- 
tridge, will not appear. This funclion is explained in further 
detail later in this section. 



Deskulntei t Pi ml 



OK 



Quality: 
OBest 
® Normal 
O Draft 

Copies: 

0 



Pages: 
® fill 

0 From: |J To: Q 

Page Order: 

□ Print Back lo Fronl I 
Print Method: 

□ Use Installed print caitridge only 



Cancel 



iPreoterul 



Help 



OeskU'iitpi I Page Setup 



Media: 

®US letter 

0 US l egal 
~ US transparency 
Paper type: 

□ HP Special Paper 
Orientation: 



OK 



Ofl-1 tetter 

O M Tr ansparency 

0 Enrielope 
Reduce Enlarge: 

1 tool % 



Cancel 



I Options 



Colors 



□ Soue as Default 



Help 



Desktllrlter t Options 



UK 



Printing Preleiences: 
H Fonl Substitution 
HUse HP Fonts 

□ Precision Bitmaps 

□ Stop Helutpeii lr onspniem les 



t lean Pnnl Cartridge: 
[ Start ] 



Canrel 
Help ) 



OeskUiiiler C Colors 



OK 



Color Blending: 
•) Pattern 
O Scatter 

fmed 8 Colors 

□ CompleH Color Pooling 



(am el 



Help 



Intensity 



Colortest 



Fig. 4. Ill" I IcNkW'riter ( printer dialog boxes 
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The Page Setup dialog box features arc consistent with other 
Apple primers in that they allow the user to choose media 
type, print orientation, and a scaling value. The distinct fea- 
tures offered by the MP DeskWriter C are the HP Special Paper 
and the Save as Default checkboxes. The HP Special Paper check- 
box is selected when printing on special paper. This causes 
the driver to adjust the way ink is put on the page so it is 
optimized for HP special paper. Save as Default offers the user 
the ability to save ilem stales permanently in the Page Setup. 
Options, and Colors dialog boxes. For example, a user who 
always prints on U.S. legal-size paper need only select the 
paper size once, then check the Save as Default box. and click 
OK to exit the dialog box. US Legal will then be the new de- 
fault, instead of the factory default. US Letter. This saves the 
user time by eliminating Ihe need to enter the Page Setup 
dialog box for each document. 

The Options dialog box, accessed by clicking Ihe Options bullon 
in the Page Setup box. contains printing preferences and a 
Clean Print Cartridge button. This button, when selected, at- 
tempts to restore print quality if print becomes faint or 
dots are missing. This is done by activating a print cartridge 
pruning algorithm in the printer. 

I'sers who waul more color capability can find it by clicking 
Ihe Colors button. The color blending selections determine 
how dots of the three colors (cyan, magenta, yellow) are ar- 
ranged on the page to create blended colors. Each of these 
selections produces slightly different results. Pattern produces 
faster output than Scatter and is recommended for simple 
solid-color graphics. Scatter lakes longer to print and should 
be used for sophisticated color graphics, such as scanned 
images and photographs. Fixed 8 allows applications that sup- 
port color on Ihe Macintosh Plus, SE, Portable, and Classic 
Computers to print the basic eight colors (red, blue, green, 
cyan, magenta, yellow, black and white). FixedR is the only 
color blending setting available for these machines. 

When the Complex Color Printing checkbox is on. Ihe driver ad- 
justs the printed colors to provide the best match in appear- 
ance to Ihe screen. This capability is recommended for com- 
plex color graphics such as scanned images, photographs, 
and complex computer-generated artwork. 

The Intensity slider allows the user to select the amount of 
color ink that is printed on the page. More ink increases the 
intensity of the images. Users in high-lumiidily environments 
may need to move the slider to the right to decrease the 
amount of ink on the page, since ink bleeding can occur 
with increased moisture in the air and on the paper. To as- 
sist the user in deciding which slider bar location creates 
the most desirable output, a color lest was created. When 
the user clicks Ihe ColorTest button, a one-page printout is 
provided dial shows the effects of the intensity settings on 
(ext. simple graphics, and a complex color image. The Color- 
Test feature is designed to save the user time by eliminating 
the need for numerous experimental printouts to determine 
the best intensity setting. 

The Colors dialog box was created for the more sophisticated 
color user. The defaults were chosen such that output will 



be acceptable for the majority of print jobs. Pattern was cho- 
sen for its speed over Scatter and the assumption thai most 
users would probably print simple color graphics, The 
middle selling In the Intensity slider bar is Ihe default . ii is 
desigtied lo work best in most environments. 

Print Cartridge Selection 

The dialog boxes described above illustrate Ihe user-driver 
interaction required to control the printer. In addition, it is 
necessary for the user lo interact with the printer hardware 
by changing print cartridges. Because (his is a completely 
new task to most customers, numerous prototypes were 
designed and lested before the final solution was created. 

Fig. 5 shows one of Ihe print cartridge swap/page setup pro- 
totypes that we explored. The three extra buttons labeled 
Color. Black/GreyScale and Auto Select were provided lo give Ihe 
user a choice of which cartridge lo print with. If Ihe user 
chose Black/GreyScale. Ihe black print cartridge was expected 
lo be in the printer, and the document would be printed in 
black. If Color was selected, the color print cartridge would 
be used. If the expected print cartridge was not in the printer, 
an alert message would appear prompting the user to insert 
the correct cartridge. If Auto Select was chosen, the document 
was scanned, and Ihe user was prompted to insert the black 
print cartridge if Only black was present or lo insert the color 
print cartridge if color was present. With a mullipage mixed 
document (black and color pages), the pages would be or- 
dered so thai all the black pages were primed first, followed 
by the color pages, eliminating ihe need lo swap print car- 
tridges more than once. The user was always given the 
choice of ov erriding the swap alerts and continuing to print 
with Ihe current print cartridge. 

Tests of litis model produced both positive and negative 
results. On the positive side, users liked the control of se- 
lecting the cartridge type themselves, without getting swap 
alerts. Choosing Black/GreyScale lo print a draft of a color doc- 
ument is one example of the control desired. On the negative 
side, this model required the users to go into the Page Setup 
dialog box before printing, which was often forgotten. 
Mosl subjects felt this extra step cumbersome and not 
"Macinlosh-like." 
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Fig. 5. A prototype HP DeskwriterC dialog box that was rede- 
signed bemuse usei n ac imns during usability testing were mixed 
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The final print cartridge swap/page setup model eliminates 
print cartridge choices from the Page Setup dialog box com- 
pletely. Instead, the machine scans for color and display's an 
alert message when a mismatch between the cartridge and 
the document is found. With this model, the user need not 
go into the Page Setup dialog box at all. The final model is 
both easy to follow and gives users the control they want. 
Instead of having the user select which print cartridge to 
use, the driver is always in auto select mode. The driver 
scans the document for color. If none is found, it verifies 
installation of the black print cartridge in the printer. If the 
color cartridge is in the printer, the user is alerted to change 
the print cartridge. The same is done if a color document is 
being printed and the black cartridge is installed. If the user 
is printing a document with both black and color pages, the 
driver firsts prints the pages that can be printed by the 
installed print cartridge. When printing of those pages is 
complete, the driver alerts the user to swap the print car- 
tridge, then continues printing the remaining pages. This 
minimizes the number of user interactions. The control pro- 
vided by the former design is offered with this model by (he 
Use current print cartridge only checkbox in the Print dialog box. 
When this box is checked, the driver will print all pages 
using the current print cartridge in the printer, and will not 
prompt the user for print cartridge swaps. 

Print Modes 

In general, print quality settings allow the user to trade 
print quality for print speed. Draft mode is fastest, and has 
the additional benefit of saving ink. Mode selection affects 
the resolution at which the page is rasterized and the timing 
and placement of the dots that are printed onto the page. 
The printer's firmware is responsible for managing the 
mechanism, prinlhcad. and low-level dot control. 

imaging Resolution. Most of the time, HP DeskWriterC drivers 
operate at 300 dpi. For faster throughput at the expense of 
resolution, 150-dpi imaging is available as draft mode. Ka.s- 
terizing at 150 dpi versus :100 dpi speeds up printing in several 
ways. First, the driver only has in draw objects at half the 
resolution. Second, at ISO dpi. ii takes ortlj one fourth as 
many bands to rastcrize a page as ii dues at .'100 dpi, because 
each band at 150 dpi represents four limes as much page 
area as it does at 300 dpi ( memory requirements grow as the 
square of the resolution). Third, the driver has one quarter 
the amounl of data lo halftone, compress, and transmit. 

Dot Timing and Placement. The print earn nine contains three 
colors of ink: cyan, magenta, and yellow. For each primary 
color, sixteen nozzles are allocated. The nozzles for each 
primary are separated vertically (Fig. (i). This means cyan 
ink is always fired onto dry media. Magenta comes next, 
possibly overprinting cyan, and yellow follows. 

In general, the printer can lay lf> rows of each color down al 
a time and then advance the paper 16/300 inch to lay down 
the next Hi rows of color. However, there are limes when it 
is advantageous to lay I he ink down more slowl.v while dis- 
tributing the ink in a single raster row among several differ- 
ent nozzles in the prinlhcad. This process is known as shin- 
gling (see Fig. 7). I'sing shingling, several separate prinl 
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Fig. 6. HP Desk Writer ("/DeskJet 500G printer orifice plate viewed 
from the paper 

passes are used to lay the ink down, printing only a fraction 
of the dots in each raster row during each prinl pass. The 
paper is advanced Slightly and I hen more of the pixels repre- 
senting that row are laid down. The printer supports both 
50% shingling and 25% shingling. 6096 shingling lays down 
every other pixel in the row on the first pass and then the 
remaining pixels in the row on the second pass using a dif- 
ferent nozzle of Ihe print head. 50% shingling takes twice as 
many print passes to print a page and about 50% more time. 
It doesn't take twice as long, since every other dol is skipped 
during each pass, allowing the printer lo increase I lie car- 
riage speed without exceeding Ihe maximum nozzle refire 
rate. Fig. 7 illustrates how the cyan ink will be printed with 
50% shingling. 

25% shingling is also available. II uses four limes as many 
passes, laying down 25% of the ink in each easier row at a 
lime. 

Shingling provides several advantages. Criming each raslcr 
in several passes gives the ink a chance to dry before the 
adjacent dots are laid down onto the page. This is especially 
iinporiani when inks of I wo differenl colors are placed nexl 
lo each other. Shingling allows the lirsl color to partially dry 
and minimizes bleed ( mixing) between Ihe I wo colors. This 
is ver> important for printing on transparency media. Laying 
Ihe ink down loo quickly causes il lo bleed anil form 
puddles. 

Shingling also distributes ihe priming of any single raster 
between two or more nozzles. This is useful for hiding in- 
consistencies belween nozzles, such as a weak or missing 
nozzle, since il distributes Ihe missing or weak dols among 
several rasters. A weak nozzle is not very noticeable when 
25% shingling is used. Shingling also hides errors in paper 
feed accuracy in a similar manner. 

When draft mechanical quality is selected, Ihe printer will 
skip every other dot. Tins not only halves the amounl of ink 
used lo prinl Ihe image, but also allows Ihe printer to In- 
crease the carriage speed for Ihe same reason il can when 

shingling, 
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Fig. 7. Shin g li ng is the process of laying down the ink dots In sev- 
eral [Kisses, printing only ■ fraction Of the dots in each faster row 
'luring each pass, (a) Tile 10 cyan nozzles spaced vertically 1/300 
inch apart anil a character to lie printed. 0>J During the first pass 
of the printhead. the first 8 cyan nozzles print half Of (he lop part 
of the character, (e ) During the second pass the first 8 cyan 
nozzles print hair of the bottom pari of the character while the 
second 8 cyan nozzles print the remaining dots in the top part of 
the character, (d) During I he third pass of the printhead the 
second 8 cyan nozzles print the remaining dots in the bottom half 
of the character. 



Resolution, shingling mode, and mechanical quality are 
mixed in various combinations to prov ide the user with three 
different quality modes for each of three lypes of media: 
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Media. Ill' DeskWriter C and DeskJet ".00 f' printers are de- 
signed to provide good print quality on plain paper, special 
paper, and transparency media. The special paper and trans- 
parency media are specifically designed to work in the print- 
ers. They are available from Hewlett-Packard as ( X Jet- 
Series C'uiSheet Paper and LX Jet Scries Transparency Film. 

The ink performs differently on the three supported media 
types. When the user informs the driver which type of media 
has been selected, the driver lakes this into account, and 
makes appropriate adjustments in the amounts of each ink 
deposited on Ihe media. 

Delivering high print quality on a wide range of plain papers 
is very difficult. As previously mentioned, print quality var- 
ies as a function Of paper type and environmental condi- 
tions. The black print cartridge is the same one used by the 
IIP DeskJet and DeskWriler printers. It is optimized to per- 
form well over a wide range of plain papers. Similarly, lite 
color print cartridge is designed to perform well on plain 
papers as well as on special paper. During development, 
print quality was tested on many different types of plain 
paper. From this large group, a small set of papers, each 
representing a significant subset of Ihe universe of papers, 
was selected. Creating this manageable set of papers for 
initial evaluation of various inks, print cartridge architec- 
tures, and print modes proved valuable. Promising combina- 
tions could then be tested against the larger set to ensure 
robust performance. 

Because we can control the design of the special paper, it is 
adjusted lo accommodate the print cartridge. The special 
paper is coated on one side. The coating causes most of the 
colorants in Ihe inks lo be deposited near the surface of the 
paper after the inks dry. Concentrating the colorant near the 
surface increases the saturation of colors. The coating also 
reduces the sensitivity to environmental conditions. When 
using special paper. Ihe user is less likely to need to adjust 
the intensity slider to accommodate environmental 
extremes. 

The transparency medium consists of ;i plastic substrate and 
a special coating. The coating accepts the ink. Without the 
coaling, ink would puddle up on the substrate and run to- 
gether, forming large muddy pools. Like Ihe special paper, 
the transparency is designed around Ihe print cartridge. 
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Color Halftoning. Applications communicate with the driver 
through a stream of QuickDraw commands. These com- 
mands specify 24 bits of color for each 300-dpi pixel on the 
page. The printer can only place three bits of information at 
each 300-dpi print grid position — one bit for each primary 
ink. Color halftoning is the process that reduces the in- 
formation from 24 to 3 bits per print position. By carefully 
controlling the placement of colored dots, myriad different 
colors are produced. Cyan, magenta, and yellow are lialf- 
toned independently. Cyan is reduced from 8 bits per pixel 
to 1 bit per pixel. Similar reductions occur for magenta and 
yellow. 

Halftoning increases the effective color depth of the printer. 
The mechanism can put one of eight different combinations 
of the primary inks (cyan, magenta, yellow, red. green, blue, 
black, or white ) on the paper at each 300-dpi print position. 
Willi halftoning, many more than eight colors can be pro- 
duced. The increased color depth comes at the expense of 
spatial resolution. The halftoning techniques used in the HP 
DeskU liter G driver preserve 300-dpi edges. Spatially, the 
human visual system is much more sensitive to the edges of 
an object ihan to color shifts within the the object. Because 
the edges are preserved, Hie reduction in spatial color reso- 
lution is not offensive, and the increase in color depth allows 
complex images, such as photographs, to be reproduced 
well by the printer. 

The user can choose between two color blending (halfton- 
ing) algorithms: patten and scalier. Pattern is a dispersed 
ordered dither ( Bayer's), while scatter uses a form of error 
diffusion. 1 

Pattern. The pattern algorithm is the default halftoning algo- 
rithm. It is a form of ordered dither. It does a good job with 
almost all types of data, bul is best suited to simple graphics 
composed of large homogeneous regions. Computationally, 
it is much simpler than error diffusion. An 8 x 8 threshold 
matrix defines the halftoning pattern. Many different pat- 
terns could be used; the best pattern depends on the type of 
image being printed and on personal taste. To keep the user 
interface simple, only one pattern is offered: Bayer's dither. 
This pattern is very good at preserving fine detail. Preserva- 
tion of detail is especially important when halftoning text 
that is not being printed at lull intensity. 

There are 64 cells in the 8x8 threshold matrix. Each cell 
contains an 8-bit threshold. The page is logically tiled with 
the matrix. Each pixel's 8-bit Value is compared with its cor- 
responding position in the matrix. If the pixel's value is 
greater than the threshold, a dot is fired at thai position; 
Otherwise, no dot is fired. This reduces the 8-bit informal ion 
for the pixel to 1 bit f fire or don 'I )• The process is repealed 
for all pixels on the page. 

In an 8 x 8 area on the page, the number of dots fired can be 
anywhere from zero to 04. This means a total of Of) (counting 
none) different amounts of any primary color can be used to 
fill an area. The original data is 8-bit. representing 2- r )(i levels. 
Because only 66 patterns are used to represent 256 levels, 
visible contouring may occur when the specified amount of 
ink gradually varies over a large region of the page. 



An example 4x4 threshold matrix is shown below. In this 
case, the pixel \alues would be scaled between 0 and 16 
before halftoning. If a pixel's value were greater than the 
corresponding threshold matrix entry, that dot would be 
fired, otherwise it would not be fired. 

4 • 4 Threshold Matrix 
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For example, assume a large homogeneous region were being 
halftoned. and the scaled values in the region were all 8. The 
resulting pattern of dots fired would be a checkerboard. 
This is because every other element in the threshold matrix 
is less than 8. 

An obvious feature of the ordered dither is the geometric 
artifact that is \isible when homogeneous regions are half- 
toned. At 300 dpi. the artifact is small and usually not objec- 
tionable. A shortcoming is the inability to represent 256 sep- 
arate levels. This means a continuously varying gradient will 
be broken up into bands (or Contours) of pixels that all map 
onto one of the 65 levels- 
Scatter. For the scatter halftoning algorithm, a version of the 
Floyd-Steinberg error diffusion algorithm 1 is used. Unlike 
ordered dither, error diffusion is not restricted to 65 patterns. 
A pixel is examined, and if its 8-bit value is greater than 12S, 
a dot is fired. The difference between the specified value 
and 255 is the error that is produced by putting down a full 
drop of ink. This error is diffused among four neighboring 
pixels, reducing their specified values slightly (see Fig. 8). If 
the original pixel's 8-bit value is less than 128, no ink is fired, 
and the error is simply the pixel's value. In this case, the 
distributed error increases the values of the neighboring 
pixels. 

Pixels are processed from from left to right along each raster. 
Rasters are processed from top to bottom. The error from 
each pixel is broken up into lour pails, which are distributed 

o o o 



O Et*-0 




Fig. 8. Bitot diffusion to neighboring pixels in the scatter halftoning 
algorithm, 
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lo neighboring pixels. Some noise is added lo I he error 
terms to tweak lip artifacts thai lend lo appear in error dif- 
fused data: The average value of Ihe noise is 0, so the image 
is not lightened or darkened. 

Error diffusion does noi suffer from contouring. Color gradi- 
ents are primed as smoothly varying regions of inereasing 
dot density, 

Error diffusion can produce artifacts. These are most likely 
to be visible in large homogeneous regions. Il is best suited 
for complex images like photographs and sophisticated pre- 
sentation graphics. Hence, il makes a good compliment lo 
the ordered dither algorithm. Error diffusion requires more 
computation than patient dither, and can degrade through- 
put. I'atlem dither was chosen as Ihe default halftoning al- 
gorithm because il is faster, and is best suited for common 
types of output 

Grey Balancing 

In theory, an equal mixture of cyan, magenta, and yellow 
inks would produce a neutral (white, grey, or black) color. 
In practice, litis does not occur. The particular inks used by 
Ihe HI' DeskWriter C and DeskJet 5001' printers, when mixed 
in equal proportions, typically produce a color with a slight 
greenish cast. To compensate for this effect, grey balancing 
is performed before halftoning. 

Grey balancing reduces the amount of cyan used for neutral 
and near-neuiral colors. Hecrcasiiig cyan increases Ihe rela- 
tive amounts of magenta and yellow. Magenta and yellow 
inks together make red. which is the opponent color of green. 
One can think of Ihe cyan redaction as an increase in red, 
which compensates for the greenish cast. Unfortunately, 
reducing Ihe amount of ink on Ihe page makes the color 
lighter, so the cyan reduction must be balanced against 
the loss in darkness. Experimental ion showed that a cyan: 
magenta:yellow ratio of 2::!:'i produces good neutral and 
black colors over a wide range of papers. 

The adjustment is made by first computing the salutation 
of the color.- The following equation j ields a value for s 
between I) and I : 

_ (max(c, m, y) - min(c, m,y)) 
maxlc m.y). 

In litis equation, c, m. and y represent Ihe amounts of cyan, 
magenta, and yellow inks, respectively. Larger values of S 
indic ate more saturated colors, while 0 indicates a neutral 
color. 

The amount Of cyan ink is adjusted based on S. If S is 0, 
cyan ink is reduced to two thirds of ils original value. If S is 
1, cyan is Unaffected. Cyan ink is adjusted linearly between 
100% and tiT' ■■> for intermediate values of S. 

Intensity Slider 

Possibly the greatest challenge in creating a plain paper color 
printer is delivering high-quality output Over a range of envi- 
ronmental conditions (temperature anil humidity] and on a 
range of media, from copier paper to high-quality col Ion 



bond. Two problems occur when fully sanitated colors are 
printed on some types of paper, oral high humidities. Color 
bleed occurs when adjacent colors run into each oilier and 
mix. Ink can even bleed through the paper and appear on 
the back side. This can occur even if shingling is used. Bronx 
big occurs when too much ink is laid down on the paper and 
the dye sits on top of Ihe paper fibers rather than soaking in. 
This overabundance of ink cryslali/.es on Ihe surface and the 
crystals reflect light. This causes a shiny reflective surface 
that actually gets lighlcr as more ink is laid down. 

As it turns out, when these problems arise, reducing the 
amount of ink used in printing the colors usually results in 
higher-quality output. I ~sc too much ink and you gel the 
problems described above: use too little, and the output 
looks dull and washed out. 

Other factors affect the user's perception of quality and col- 
or accuracy, such as room lighting, computer display varia- 
tions, personal taste, and Ihe type of data being printed such 
as line art versus an image. Rather than allempl to charac- 
terize all of these factors, we supply a control that is analo- 
gous to the brightness control on a television set. This is 
called the Intensity slider. 

The intensity slider is a live-position slider available under 
the Options dialog box described above. It allows the user lo 
control the intensity (saturation) of the colors on the page. 
This is done by controlling how much ink is used to create 
any particular color, which affects how saturated Ihe color 
appeals as well as other print quality factors described pre- 
viously. At Ihe slider's lowest setting, il can reduce the 
amount of ink used lo generate a color by as much as 7096. 
This reduction is applied to Ihe 24-bit data in the raslerized 
band just before halftoning. At Ihe slider's highest selling, no 
reduction in ink volume occurs. The degree lo which ink is 
reduced is a nonlinear function dependent on how much ink 
was specified in the first place. The percentage reduction is 
larger for more saturated colors, because these colors are 
most affected by problems associated with media type and 

environmental conditions. 

Edge Enhancement 

One unfortunate side effect of the intensity slider is thai 
colors that would normally be a solid area fill of ink are now 
created with a dithered pattern of dots. This is even true for 
black printed with Ihe color cartidge. At the minimum set- 
ting; about one third of the maximum posssible volume of 
ink is used. This may produce Ihe best-quality black for area 
fills, but the edges of black characters will be rough in 
appearance and may have color halos. 

The driver implements a simple edge enhancement algo- 
rithm. If a pixel is black and one or more of ils four nearest 
neighbors (north, south, east, or west ) are while, il is con- 
sidered an edge pixel. Edge pixels are always printed with 
one drop each of cyan, magenta, and yellow. They are never 
depleted. 

Fig. 0 shows how edges arc enhanced. 
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Fig. 9. Edge enhancement in Che HI' DeskWriterC printer, fa) The 
edge of s black character with no saturation reduction or color 
correction, no The edge of a black charactei with color correction 

anil a low selling of l lie intensity slider. The cenler <tf lite charac- 
ter will appear black, lull the edge is l ough hecait.se of missing 
pixels Also. In -cause unequal amounts of cyan, magenta, anil yel- 
low arc used to produce hlaek. Hie edge has a reddish line or halo 
because of ihe absence of cyan dots. If this were the lower righl 
portion of 8 character, the rightmost pixel or the seril would be 
totally missing. (<£) VWth edge enhancement, every pixel thai is 

ailiarent tO 1 while pixel is printed Willi Cyan, in.igenla. and yellow 
dots. This creates a smooth edge without unwanted hues. 



Selec tion of Printed C olors 

The color halftoning techniques previously described allow 
the printer to produce myriad colors. The task of determin- 
ing which colors to print seems simple, but turns oul lo be 
complex. Al first glance, one might suggest putting colors on 
paper that exactly match the colors on the monitor. In prac- 
tice, this produces surprising results. The situation is further 
complicated by the type of image printed Processing a 
scanned image of a person's fac e and a bar chart in the same 
manner may not be a good idea. 

The goal is to not surprise users when they receive the out- 
put. To this end. the HP DeskWriler G driver allows the user 
to choose between two color selection paradigms. The de- 
fault is optimized for simple business graphics, but usually 
gives acceptable results with any type of output. The Complex 
Color Printing checkbox biases color selection for scanned 
photographs, sophisticated presentation graphics, and other 
complex images. 

Chroma-Based Selection (Default). The default color selection 
model places a high priority on the chroma (colorfulness 
judged relative to a neutral area of similar lightness) of col- 
ors. On CRTs, chroma increases when the beam intensity of 
one or two of the electron guns is increased. This also pro- 
duces a lighter color. On paper, chroma is increased by pul- 
ling down more of one or two primary inks; this darkens the 
color. In the chroma-based selection scheme, priority is giv- 
en to chroma over lightness. When printing simple business 
graphics, such as bar and pie charts, customers usually desire 
solid, high-chroma colors. The fact that printed colors appear 
significantly darker than monitor colors is not objectionable; 
it is usually desirable. 

Colors are passed to the driver in RGB format. Red modu- 
lates the amount of cyan ink, green modulates magenta ink. 
and blue controls yellow. More red on the screen means 
less cyan on the paper. The cyan, magenta, and yellow inks 
can be thought of as "negative red." "negative blue," and 
"negative green," respectively. 

A one-dimensional correction function is Independently ap- 
plied to each primary before it is liall'toned. This function 
accounts for two factors. First, the perceived intensity of the 
monitor is not linearly relaled to electron gun voltage, and 
second. I he perceived ihirkncss on Ihe paper is not linearly 
relaled lo the iuuount of ink deposited in a given area. 

When high-chroma colors are darkened by Ihe driver, they 
are not as distinguishable as their counterparts on the moni- 
tor. This is not generally a problem, but some applications 
are capable Of producing fairly complex business graphics. 
Kor example, in a three-dimensional bar chart, the lop. front, 
and side of a bar can be colored with different I ones of Ihe 
same hue. This gives the illusion of depth on the screen. If 
the three faces of ihe bar don't have the correct tones on 
paper, the sensation of depth is reduced. In this case, ap- 
pearance based color selection may give superior results 

Appearance-Based Selection. Scanned photographs and 
oilier sophisticated images can be distorted objectionably by 
chroma-based color selection. In these images, differences 
in lightness often c onvey depth information. Artificially 
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darkening high-chroma colors can make the output look 

unnatural, hp DeskWriter G users can instruct the driver to 

optimize color selection for these types of output. 

A white patch on a monitor is not the same color as white 
paper. This can easily be seen by holding a piece of paper 
next to a CRT. White viewed on the monitor will (probably) 
have a bluish cast. The bluislmess is not usually noticeable 
because the human visual system is remarkably adept at 
accommodating a wide variety of while points. Through 
green, grey, and rose colored stmglasses, snow looks white, 
grass looks green, and other objects appear as expected. 
This is the phenomenon of color constancy. The visual sys- 
tem adapts to the monitor's white. Only when another suit- 
able reference white is placed in its proximity does the 
monitor appear bluish. 

If colors on paper were selected to match the monitor ex- 
actly, monitor white would have to be printed as a pale blue 
tint. However, the eye has difficulty in accepting tinted paper 
as while. Because the HP DeskWriter t* printer is a 300-dpi 
binary printer, pale blue must be created by a sparse scatter- 
ing of relatively lar ge blue dots. Monitor white is homoge- 
neous, while the paper pale blue version would appear 
textured. 

A belter approach is to accept the fact that unmarked paper 
and the monitor appear white when considered indepen- 
dently. Colors are selected for printing so they appear, rela- 
livo to unmarked paper, the same as the monitor's colors 
appear relative to the CRT's white point. 

Gamut Issues. The gamut of a color device is the sei of all 
colors ii can reproduce. Typical monitors have a larger 
gamut than the HP DeskWriter C printer. This means they 
can display colors that Cannot be reproduced by any com- 
bination of the printer's cyan, magenta, and yellow inks. A 
device's gamut can be modeled as a three-dimensional solid. 
The shape is irregular, bul roughly resembles a lumpy foot- 
ball. The idea is to compress the monitor's gamut so I hat it 
fits inside the printer's gamut Some compromises must be 
made when choosing a gamut compression algorithm, and 
although objectionable distortion is controllable, it cannot 
be eliminated altogether. 

Characterize Monitor. To match colors on a monitor, ii must 
be known how the colors on the monitor appeal' Co the 
viewer. A spectroradiometer can be used to measure the 
colors produced by various intensities of red. green, and 
blue on a CRT. The device measures the amount of radiant 
energy emitted by the monitor as a function of visible wave- 
length. The system is well-behaved, and a reasonably simple 
model can be used to predict colors accurately once a few 
constants have been determined. 

Apple's Macintosh color monitors all use Sony Trinitron 
( RTs. The k!-inch Apple monitor was chosen as the target 



monitor for the HP DeskWriter C driv er. Other sizes of 
Trinitron CRTs are available from Apple and other vendors. 
Other brands of CRTs are also av ailable. However, the 
P)-ineh Trinitron heavily dominates our target customer's 
environment. 

Characterize Printer. The HP DeskWriter C printer was chat 
acierized by printing a sample consisting of hundreds of 
small patches of different colors. Each patch contained a 
known percentage of cyan, magenta, and yellow dots. These 
were measured with a spectrophotometer. This instrument 
measures the amount of light reflected at various wave- 
lengths across the visible spectrum. From this data perceived 
colors can be calculated. 

The HP DeskWriter C and DeskJet 500C printers are de- 
signed to be used with HP special paper, IIP transparency 
media, and a wide variety of plain papers. The specified spe- 
cial paper was used to characterize the printer. Transpar- 
ency color selection is always chroma-based, so rigorous 
color characterization was not required. 

Two sets of samples were created, one using the pattern 
halftoning technique and one using the scatter technique. 
Each set was printed on special paper and plain paper, 
resulting in four sets of samples overall. 

The choice of plain paper proved to be very challenging. 
Good prim quality is needed across a wide selection of 
papers. Patches of eight colors were measured on over sixty 
varieties of plain paper. With this information in hand, a 
single "representative" plain paper was selected. This good- 
quality 25% cotton bond was used for the plain-paper 
characterization. 

The selection of a single target plain paper Simplified analy- 
sis. The user interface is straight forward; it allows the user a 
simple three-way selection: plain paper, special paper, or 
transparency. Although print quality may not be optimized 
for any given plain paper, it is well-controlled on almost all 
of them. When color accuracy is critical, HP special paper is 
recommended. 
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An Interactive User Interface for 
Material Requirements Planning 



For planners and buyers in the manufacturing business environment, 
HP MRP Action Manager is an online, interactive tool that automates 
many of the traditional paper-intensive activities of material requirements 
planning. 

by Ah ina Y Nishimoto, William J. Gray, and Barbara J. Williams 



MP MRP Action Manager for NewWave is an enhancement 
to HP Materials Management (HP MM.) software. IIP MM 
provides an online materials management tool for produc- 
tion planners and buyers. 1 The IIP MRP Action Manager 
operates within the HP NewWave environment 2 to provide 
an easy-to-use. Interactive user interface between a personal 
computer and the HP 3000 computer. 

In a typical manufacturing environment, planners and buy- 
ers receive MRP (materials requirements planning) reports 
after running a regularly scheduled MRP program. These 
reports, called action reports, show suggested actions for 
work orders and purchase orders to meet future demand for 
Inventory. The suggested actions include changes to pull in 
and push out dates* cancellation of existing orders, and 
suggestions for new orders. 

Planners and buyers typically have to wade through moun- 
tains of paper reports to determine the suggestions they 
would like load on. Then, they make any changes on paper 

Push out indicates that an open order 's due date should be rescheduled larther in the dilute, 
and Dull m indicates mat an open order s due date should be rescheduled with an earlier due 
dale 



and manually enter these changes into an MRP package 
such as HP MM. 

HP MRP Action Manager software automates this process 
by providing the following features: 

• Dynamic selection control. Each planner or buyer reviews 
the entire action report or a subset of the action report, fil- 
tering the information displayed by selecting some combina- 
tion of MRP actions (pull in. push out, etc. ), pan numbers, 
user-defined items, product line, and vendor. This feature 
eliminates the unnecessary printing of MRP reports, result- 
ing in reduced printing costs and increased lime savings. 
Fig. 1 shows the Action Manager window for selecting MRP 
information. The selections shown in the window indicate 
that the user is selecting parts for controller 7S) that have 
the ABC code A in product line 1. Pig. 2 shows the window 
for selecting a specific action to be performed on orders. 
This window appears when the Specific Actions command is 
selected from the selection window shown in Fig. I, 

• Online review and approval of MRP actions. Planners and 
buyers can approve MRP actions with or without changes, 
cancel MRP actions, or hold MRP actions for later review. 
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Specilic Aclions lor Orders 
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Fig. 2. An HP MRP A< limi Mana^i'i' window fur .si-lrciiiig.i s|»Tifi< 

Iictiull III hi performed 'III orders 



i Online recalculation of projected inventory, Recalculation 
Of projected Inventory occurs when order quantities and 
dates change. This IfitefacthnS "whai-if analysis immediately 
shows the planner the imparl of changes on inventory levels. 
If the imparl is not desirable, the planner can resel the 
changed selections lo the Original values suggested by MRP 
An ion Manager. 

Automated order additions and changes. Order additions 
and changes arc transferred to 111' MM for automatic update 
through the batch interlace. This feature eliminates the 

need Co reenter the data manually through HP mm online 
data-entry screens, 

I tired pad or order find. I 'sing wildcard searches and 
selection windows, planners identify and quickly display 
specific parts or orders ( Fig. 3). 

( Inline order pegging. By using online pegging, the planner 
can examine Ihe upward and downward links in a bill of ma- 
terial, level by level, to End oul where the demand is coming 
from thai creates a requirement for an order, or lo see the 
impact a supply order has on lower-level components. 
Workspace comments. The planner can create comments in 
Ihe MRP action workspace reports lo annotate the decision- 
making process. 

< iptimizod system resources. IIP MRP Action Manager uses 
a cooperative elienl/server interaction between PCs and the 
HP :i()l)0 computer for optimum use of system resources. 
Ease of use. Action Manager has a graphical user interface 
thai lakes advantage of the flexible features of the IIP New- 
Wave environment and provides online help with browsing 
and searching capabilities (see Fig. I). 



Select part or type of 
order lo find. 



Type in Ihe number to 
lind or Ihe first few — 
digits of Ihe number. 



Scroll Ihe list box, then . 
select Ihe number to find. 



The first 30 numbers are 
in the list box. 



Use Forward and 
Backward to list 
nextyprevious sel 
of numbers. 



® Part 

O Existing Supply Order 
O Suggested Supply Order 
O Demand Order 



Find: 



PART-NUMBER 



N0113QC 

N0150HP 

N0150HW 

NOI50HZ 

N0152DR 

N0152EZ 

N0152FJ 

N0153BY 



1-30 of 151 
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Find 



Cancel 



Help 



Fig. 3. The Action Manager win- 
dow for quickly finding an order or 
a pari . 
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MRP Action Manager Help - MRPACT.HIP 



File Edit Bookmark Help 



Help Inde. 

For general information on how to use Microsoft Help, select Help from the menu above 

For specific information on now to use help m HP MRP Action Manager, click 
Unna H P MRP Action Manager Help 

For general information on MRP Action Manager. click Basics 

For information on a specific topic, click the Search Outran above. 

For information on menus and commands, click topics from the list below 

Menus in the MRP Action Manager Window 

Action Menu 
Settings Menu 

Menus in the MRP Action Workspace: 



Action Menu 



Fig. 4. Uir lp window in HP WW 
Action Manager, which uses the New- 
Wavi- hdp Quality 



With HP MRP Action Manager, the planner can analyze MRP 
data and execute MRP suggestions more quickly and effec- 
tively than by using the traditional printed reports and 
manually entering data into IIP MM. The process a planner 
would use with the IIP MRP Action Manager might involve: 
Selecting subsets of MRP output from the NewWave 
interface on the PC 

Analyzing suggested changes and new orders 

Taking advantage of NewWave's cut and paste functionality 

to Incorporate portions of the MRP report into memos and 

spreadsheets 

Forwarding necessary Order changes anil additions to 
IIP MM for automatic processing. 

Action Manager Architecture 

The architecture of the IIP MRP Action Manager is based on 
a client/server model. The client (a PC running NewWave I 
Communicates over the network with the server (an IIP :!()()() 
miming I lie MPK/iX operating system). This architecture is 
shown In Fig. B. 

PC Architecture. I 'penning in the NewWave environment 
allows Action Manager to provide a user will) all of the 
benefits of the NewWave user inlerface A planner or buyer 
can easily perforin the tasks necessary to complete an MRP 
job and get access to other NewWave applications such as 
NewWave mail and object storage. 

The P( ' network requirements are IIP ThinLan. IIP StarLan 
10. or IIP KlherTwisI and any Microsoft " Windows sup 
ported networking package. Because of the amount of data 
and the transmission rate, serial support is noi provided. 

Communication between the PC and the I|p:i(ll(l) is handled 
by a sel of routines provided by an IIP (iienl/sener develop- 
ers kit called IIP Cooperative Services. HP Cooperative 
Services is a package of library modules and executable 
programs that provide the PC application developer with 
the tools necessary In write P( -lo-IIP ;i()0(l client/server 



applications easily. The IIP Cooperativ e Sen ices developer's 
kit includes intrinsics for connection and session manage- 
ment, remote IIP Image database management system calls, 
remote MPE calls, and user-defined remote procedure calls. 

The IIP Cooperative Services routines were chosen because 
they required the least amount of coding by the IIP MRP 
Action Manager development team. They also provide trans- 
parent access* to the network, and mi inlerface lo the IIP 
Image database on the IIP-'IOOO. Since these routines are also 
used in other IIP products that require high-speed network 
transfers, we felt thai the performance of these routines 
would already be tuned for the IIP MOO. Nol hav ing lo code 
the nclworkiim calls saved a tremendous amount of time and 
lowered the risk of running into technical problems later. 

HP 3000 Architecture. The IIP ( imperative Services calls em- 
bedded in the IIP MRP Action Manager allow it to access IIP 
:1000 tiles and databases. Furthermore, remote procedure 
calls from the PC lo MPK/iX routines enable I he IIP : 10(H) to 

accomplish the heavy workloads ami free the Pi ' for other 

work while a retrieval lakes place. 

The Action Manager software on the IIP-IOOO uses the ap- 
pliCatiOn CUStOniizer and application monitor software sys- 

iems. :U The application oistomizer provides a method for 
customers to tailor HP MM bo indiv idual needs and the ap- 
plication monitor automates many of the day-to-day admin- 
istrative functions usually performed by an operations staff. 
These customizable technology software tools ' implement 
much of the detail work involved in opening databases and 
files, handling terminals, and updating screens. 

The IIP MRP Action Manager architecture is designed In do 
ttlOSt of the processing on Ihe HP 3000 tO lake advantage of 
the processing power of lhal machine The PC prov ides Ihe 
planner with a graphical user inlerface and online MRP 
calculations for immediate feedback on the actions taken 
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This organization became especially important when dealing 
wiih ihe data customization feature on the HP 3000. AH for- 
malling of Ihe MKP workspace is done on the BP 3000 lo 
avoid having to eode for the PC all the same cusiomizer in- 
trinsics used on the HP 3000. The MRP load program on (tie 
IIP 3000 creates layout information about the workspace so 
that file PC knows where ihe information it needs is stored, 
and when altered, how ihe planner wants the information 
displayed on the PC. Oilier Information such as significant 
literals, search ileni names, and field names are also created 
by Ihe MRP load program Tor Ihe PC lo access. 

Action Manager Tasks 

The following sequence describes Action Manager's lasks 
and data flows between Ihe PC and IIP MOO environments. 
The sequence numbers correspond to Ihe numbers in Fig. (>. 

1. An MRP batch Job is run on Ihe HP .1000. The job creates 
MRP output files, and if requested, generates MRP reports. 
The MRP output files contain MRP suggestions, ad ions, 
requirements, and part information. 

The IIP MRP Action Manager load program reads ihe 
MRP output files, sn ips ihe MRP output, places statistics 



and indexes in Ihe MRP database, and creates a separate 
report file for each conl roller identifier (planner or buyer). 

3. When the planner activates the Action Manager from the 
NewWave desktop, the sellings. Ihe controller identifier, and 
the selection criteria are defined. The creation of the Action 
Manager workspace is also initiated. 

i. The Action Manager control facility uses the MRP data- 
base to identify the pails specified in step 3, and then re- 
Irieves corresponding action report data from the action 
report files on Ihe HP 3000. 

5. The control facility formats Ihe workspace data and uses 
the NewWave object management facility 1 ' to create a new 
Action Manager workspace object. 

6. Willi the new workspace, the planner reviews, changes, 
and approves MRP suggestions. The NewWave desktop is 
used tp switch back and forth between HP MM screens and 
workspace windows so that the user has access lo all Ihe 
relevant dala for fasler decision making. 

7. The. planner initiates batch transfer of all actions from the 
Currently active workspace to HP MM on Ihe IIP 3000. The 
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planner is prompted fur the controller password to autho- 
rize the transfer, and the password is validated against the 
III' MM database 

S. The Action Manager updates 'he ac tion report file on the 
HP 3000 with the marked actions (e.g.. pull in, push out ). 

i». The Action Manager completes the transfer, then triggers 
the Action Manager update program on the IIP 3000. 

1(1, 11, 12. The Action Manager update program scans the 
action repoii file lor approved changes and additions thai 
heed processing, updates the MEF database with changes, 
then creates hatch transaction records and sends them to 
the IIP MM continuous background batch processor. 



13, 14. The background batch processor processes the hatch 
transactions, completing the IIP MM updates. 

15. The planner archives the MRP action workspace. The 
data in the workspace can still be reviewed, but changes and 
approvals can no longer be done. 

Development Environment 

For the IIP MRP Action Manager software running on the 
PC, two software development tools were used to generate 
the early prototypes, The rust tool, CASE;W by < aseWorks 
Inc., was used to create the main windows (menus and 
menu items) and to generate the Microsoft Windows code. 
The cock- generated is highly structured and well-commented, 
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whicil made it easy to determine where to insert the func- 
tionality required to make Anion Manager work. The other 
tool. WaveMaker. which is an internal development tool, was 
used to read the C ASE:VV generated code and insert the calls 
to the NewWave OMF (object management facility), API 
(application program interface), agent, and help Facilities. 2 

Willi the use of these tools we were able to create working 
prototypes very quickly and present them to our developinenl 
partners and project team members for feedback. When the 
visual and usability aspects of Action Manager were refined, 
traditional methods were employed lo complete the PC-side 
development. The software revision controller on the HP 
,'iOOO was used lo handle revision control. 

For the HP :iOO(l development environment, the application 
customizer and the application monitor mentioned earlier 
were used. These two software tools are used by application 
designers to develop generalized software systems for 
HP -MOOO systems. The application programmer uses the 
application customizer inirinsics to manipulate data, since 



crucial information about data formats and fields is not 
known until run time. (Users can modify data formats and 
fields.) 

While Ihe application customizer intrinsics make customiz- 
able applications more manageable, the parameter lists 
and code to load the information can make coding more 
difficult. A layer of routines on top of the application cus- 
tomizer inirinsics, called customizer technology intrinsics. 
are used by the programmer to make ii easier to develop the 
application code. 

Action Manager developers also took advantage of reus- 
able code originally developed for the HP JIT (just in lime) 
product.' HP. JIT placed many of the common calls to Ihe 
underlying customizer technology lools into utility subrou- 
tines. Being able lo develop the HP 3000 portion ofthe 
Action Manager with reusable code helped us to be more 
productive and ensure better quality, consistency, and 
standardization in the code. 



HP MRP Action Manager Project Management 



When the investigation for the HP MRP Action Manager project started, we dis- 
covered that in addition to the challenges presented by the technical issues, we 
also had to define clear ly the goals of the product to complete the pioject on time. 
As is typical of many projects these days, these requirements meant that we had 
to make some changes to our traditional development process 

Purpose and Direction 

To ensure lhat we had a stated goal for the project and at the same time ad- 
dressed the technical issues facing us. during the investigation phase the develop- 
ment team focused on technical issues and the project managei worked with the 
team 10 develop a purpose and direction statement This statement documented 
the key aspects that were critical for our project's success. The following are the 
key areas identified and some of the subsequent actions we took to address them 

• The product must meet the customers' needs as simply as possible. We wanted to 
ensure that the functionality of the product would encourage Ihe user to prefer to 
use HP MRP Action Manage' instead of doing the work manually. We believed 
that if some aspect of the users job was more difficult or cumbersome with Action 
Manager, the usei would abandon using the product and revert to doing the job 
manually. 

• Ihe features in Action Manager must satisfy the top requests from the installed 
base of HP MM customers and HP field representatives We worked to ensure 
that the features we implemented in Action Manager truly satisfied customers' 
requests We also focused more attention on these features, since we knew that 
our customers had some particular interest in them. 

• There must be minimal impact to the existing HP MM product In past projects, we 
found that new enhancements (software changes) to HP MM often introduced 
defects to the existing software We agreed to focus on minimising changes to 
HP MM to eliminate the support headaches that often followed a software 
enhancement release 

• The product should support HP's cooperative computing siiategy As corporate 
citizens, we tell it was important to be consistent with the direction HP was head- 
ing in softwaie We formulated the product idea to take advantage of client/server 
technology and to use NewWave as a basis 



The process of developing the statement was more critical than the statement 
itself. It was in our discussions about our purpose and direction that we came to a 
common understanding of our goals 

One simple action was neglected, however We failed to share our purpose and 
direction goals with other project teams in the nrganization Their understanding 
of our project's goals was very different This resulted in some issues later, espe- 
cially during the integration phase when our project arid other projects became 
interdependent (because we were all completing our software for one combined 
release). 

Customer Focus and Understanding 

Some of the team members were experts on the targeted end users of Action 
Manager. Others had little or no knowledge of the customers' functionality needs 
(but had technology expertise instead). Throughout the project, we took advantage 
of every possible opportunity to show the latest prototype or actual product soft- 
ware to a customer, sales representative, or field engineer. This constant exposure 
to customers served two very important purposes First, it continued to educate 
every team member so that we could make better product decisions Second, it gave 
us a sense of whether we were on the right track with our product. Initially, we 
had to make some maior adjustments in oui ideas for implementing the product, 
but these changes became less and less significant as tbe project progressed 

Managing Dependencies 

This project required high-quality software components from other organizations 
within HP We were dependent on HP NewWave, HP Cooperative Services, NetlPC, 
NS services, and the Microsoft Windows Toolkit When we started the project. 
NewWave 2. 1 was the latest version available. We had to convert to NewWave 
3.0 in the middle of our project since n was key for us to release with that version 
We also had to transition from NetlPC networking software tc NS services. Thus, 
we were very dependent on other organizations to meet our schedules. We were 
able to tap into the expertise of engineers in these other organizations and resolve 
issues associated with these dependencies when they arose. 
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Development Process 

Since Microsoft Windows and the NewWave environment 
were new technologies for us. we did noi use every step of 
the traditional software development process to meet the 
objective of producing a high-quality product in a short lime. 

Development Partners. Throughout the development of the 
product, we used development partners ( both internal and 
external (0 IIP) to help in the design of the product. For 
example. HP MM internal and external cusiomers provided 
feedback on functionaliiy. usability, and reliability, and 
current product engineers and support engineers pro\ tded 
feedback on maintainability and supportability. 

Prototyping. Each prototype* completed was presented to OUT 
development partners for review. Because of the tools men- 
tioned above, we were able to make changes le> the soft ware- 
while the development partners we're sitting right there- with 
us. The formal presentations of the product were supple- 
mented with many informal presentations between each 
development cycle-. The- most important aspect of this proto- 
typing phase was that we were never afraid to throw away 
code to incorporate the requested functionality. 

Key Features. Many of the ke-y features incorporaled into 
Action Manager were base-el on user fe-e-dback and our ob- 
servations of our partners' working environments. Some of 
these features included: 
• Filtering action re-ports to allow planners lo see those parts 
thai are the highest -priority. We Observed our partners using 
a variety e>f methods to reduce the mountains of paper pro- 
duced lor a traditional MRP report. Most often they had to 
resort lo manual methods, thumbing through listings and 
paper-Clipping the most important pails of I he report. MRP 
Action Manager provides planners, through their chosen 
Biters, with the ability to see only those parts of the- MRP 
report that interest them 



• Adding usability improvements in choosing filtering criteria 
Once we gave our partners the ability to choose their own 
filters, we found thai the dialog boxes used for selecting 
these filters were cumbersome anel confusing We originally 
presented the dialog boxes in Figs. 1 and 2 as one dialog 
box. This presentation created a lot of confusion for users. 
Subsequent revisions attempted to eliminate choices that 
the planner rarely used. Tills process proved fruitless, sine-e 
almost every e-ombinalion was of some importance. After 
many iterations and usability evaluations, we cheise- the final 
set of dialog boxes in which the grouping titled Only Remaining 
Work represented the planner's most likely choice, but the 
grouping titled Specific Actions.. . allowed the planner to cus- 
tomize this grouping. By changing to two dialog boxes, we 
eliminated (he clutter of the one combined dialog box. while 
allowing everyone to select the- final combination. 

• Highlighting each planner's highest-priority actions. We ob- 
served that every' printed action report was highlighted with 
those actions most important tO that planner. In the prod- 
uct, we filter the reports based on the planner's selections 
and highlight only those actions on the report thai they have- 
chosen (see Fig. 7). 

• Providing comments lo annotate decisions. Every planner 
we visited had an action report with notes scribbled in the 
margins. To provide this ability in the Action Manager, we 
added the capability for users to adel annotations with each 
key decision they make on the action repent. 

• Providing the ability to archive an action report. Planners 
often comment ei I on how they kept past ae-tion reports 
nearby for easy reference in case they had a question about 
a past decision they hael made. We- provided in Action Man- 
ager the ability to archive a workspace with the annotations 
so that planners could e-asily reference those Workspaces 
and annotations when needed. 
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Testing. Throughout the development of the product, we 
used formal design reviews, code inspections, and testing to 
Contribute tO the Overall Correctness of the code in each 
module. In the area of testing the product, we used a com- 
bination of formal, ad hoc. and usability testing. 

Developed by an engineer familiar with the design of the 
product, the formal testing process included lest scenarios 
with documented expected results. Engineers tested those 
modules that they had not developed. This strategy revealed 
delects that the author of a module, because of familiarity 
with the code, would often miss. 

Ad hoc and usability tests were informal tests done by an 
engineer not familiar with the design of the product but very 
familiar with MRP and the planner's job. In many cases, ad 
hoc testing proved more effective than formal testing in 
Ending defects in the product. With little knowledge of the 
Internals of the product, the ad hoc tester could concentrate 
on testing what a planner expected from the product with- 
out being biased because of familiarity with the product 
design. 
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ol the University of California at Beikeley He is mar- 
ned and enjoys reading, hiking, and racquetball 

Gregory O. Burroughs 



Greg Burroughs contributed 
to the design verification of 
the HP 9000 Series 700 com- 
puters A native of San Fran- 
cisco. California, he received 
his BS and MA degrees m 
mathematics in 1978 and 
_ 1979 from the University uf 
California a" Riverside and 
Ins MS degree in computer science from the Univer- 
sity of Wisconsin at Madison in 1981 With HP since 
1981, he has worked on computational geometry, 
VLSI design tools, microprocessor and floating-point 
design, validation, and testing, performance analysis 
uf CPUs, floating-point coprocessors, and I/O sys 
terns, and workstation design verification. He is the 
author or coauthor of eight technical papers on CAD 
software, hardware performance, hardware testing 
and validation, and software engineering Greg is a 
member of the Mathematics Association of America. 
His leisure activities include ballroom dancing, 
volleyball, and singing in a church choir 

Audrey B. Gore 

As a member of the system 
verification team for the HP 
9000 Series 700 computers, 
Audrey Gore developed 
tests, analyzed simulation 
results, and maintained the 
model database She re- 
ceived hei BSEE degree from 
~* Brown University in 1988 
and joined HP's Systems Technology Division as a 
hardware engineer, developing a bus analyzer card 
for the HP 9000 Model 815 computer She is currently 
doing system verification at the Entry Systems Divi- 
sion Audrey is originally from Tappan, New York. She 
plays oboe with the Santa Clara University orchestra 
and English horn with the Peninsula Symphony She 
also plays acoustic guitar and enjoys writing and 
recording songs 




Steve W. LaMar 

Steve LaMar has done de- 
sign verification and testing 
for various floating-point, 
CPU. and system designs 
including the HP 9000 Series 
700 workstations and the HP 
9000 Model 815 CPU chip A 
native of Fresno, Calilorrua. 
he received his BS degree in 
computer science engineering Irom California State 
University at San Jose In 1985 He has been with HP 
since 1984 when he joined HP Laboratories, and is 
now with the Entry Systems Division. He is married, 
has a daughter, and enjoys skiing, tennis, and 
weightlitting 

Chi-Yen R. Lin 





Robert Lin is a software 
development engineer with 
HP's Entry Systems Division 
He was a member of the 
design verification team for 
I he HP 9000 Series 700 
workstations and for the HP 
9000 Model 81 5 computer 
Before joining the HP Sys- 
tems Technology Division in 1987. he developed 
graphics system software for Daisy Corp A native of 
Taiwan, he received his BS degree in fishery from the 
College of Marine Science and Technology In 1983 
he received MS degrees m oceanic science and com- 
puter-aided design from the University ol Michigan 
He is married and has one child 



Alan L. Wiemann 

' Entry Systems Divismn proj- 
' ect manager Alan Wiemann 
j ^^^^ has been working with PA- 
ij» ' RISC systems since he 
*T" P i oine< ! HP Laboratories in 
mH^^Eh|| '984 He managed the sys- 
^^B^H tern verification team for the 

j 

^^^^ " stations A native of North 
Dakota, he received his BSEE degree in 1980 from 
the University of Connecticut and his MSEE degree in 
1981 from Stanford University He has coauthored 
technical papers on computer architecture and pro- 
cessor design Before coming to HP he designed digi- 
tal telephone systems and components at Bell Labo- 
ratories. In addition to computer architectures and 
system verification, Alan's interests include scuba 
diving, hiking, bicycling, photography, gardening, and 
travel 

43 Model 720/730 Mechanical 
Design 



John P Hoppal 




John Hoppal is a technical 
contributor with HP's Entry 
Systems Division He re- 
ceived his BS degree in me- 
chanical engineering from 
Colorado State University 
and joined HP u> 1979 He 
has done mechanical design 
for several generations of 
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HP 9000 compute's including the Series 500, the 
Models 825 and 835. and the Models 720 and 730 
Two pending patents on the disk tray and EISA iray 
concepts for the Models 720 ana 730 have resulted 
from his wort Born m Sterling, Colo/ado. John s 
marr-ed and has DM sons He enjoys bicycling, 
hiking, traveling, and hi; family 

Arlen L Roesner 

7 product designer Arlen 

I i~ •> / Roesner leceived his BSME 
{ * degree from Montana State 

■^tV * ' '.' ft" 1 Collins Systems Division 
v.i- ib man 

factoring engineering for the HP 9836 and HP 9000 
Models 825, 835, and 845 computers and mechanical 
design for the HP 9000 Models 720 and 730 work- 
stations He is married, has one child, and enioys 
fishing and woodworking 

49 Series 700 Manufacturing 

Kevin W. Allen 

Kevin Allen received his 
^i^k BSME and MSME degrees 

in 1 97H and 1 979 rrnm the 
™ . " University of California at 
Davis After joining HP's 
Desktop Computer Division 
UI1979. he worked in R&D 
for four years and as a pro- 
cess engineer for five years, 
then became process engineering manager In that 
capacity, he was responsible lot implementation of 
the surface mount process tor HP 9000 Series 700 
computer manufacturing at HP's Colorado Computer 
Manufacturing Opeiation A member of the Interna- 
tional Society for Hybnd Microelectronics, he has 
coauthored several articles on surface mount pick- 
and plane equipment and surface muunt process im- 
plementation He is married! has iwo daughters, and 
plays sodbail and basketball 

Samuel K. Hammel 

Kelley Hammel is a new- 
product engineer at HP's 
Colorado Computer Manu- 
facturing Operation For the 
HP 9000 Series 700 work- 
stations, he was responsible 
for introduction to manufao 
luimg, design for test, test 
strategy and implementa- 
tion, and test controller and (lata collection system 
development Before coming to HP, he was a design 
engineer for Mohican Enterprises A native of Ports- 
mouth, Vngima. he received his BSEE degree in 1988 
from Ohio Slate University and Mis MSEE degree in 
1991 from Stanlord University He is married, has a 
daughter, en|Oys volleyball anr) model railroading, 
and seives as youth director at his church 




Anna Marie Hargis 

J | M I Process Onve>opmer: engi 

Colorado Computer Manu- 
facturing Operation m 1988 
A native of Cincinnati, Ohio 
ana a 1987 Ohio State Uni- 
versity graduate with a BS 
degree in industrial and sys- 
tems engineering, she itad 
previously done manufacturing consulting with Arthur 
AnOerson and Company At HP, she has done process 
engineering for through-hole printed circuit board 
assembly, and equipment layout and process design 
for the surface mount production line for HP 9000 
Series 700 workstations She is a coauthor ot an 
article on surface mount assembly 

Paul Roeber 




As manufacturing engineer- 
ing manager at HP's Exeter 
Computer Manufacturing 
Operation, Paul Roeber's 
contribution to the HP 9000 
Series 700 workstations was 
the establishment of a new 
process tor the release of 
software products Paul 
came to Apollo Computer Systems In 1982 from Data 
General Corp . where he was software manufacturing 
engineering manager A native of Boston Massachu 
setts, he holds a BS degree in industrial engineering 
and an MBA degree, both from Northeastern Univer- 
sity He is married, has twn children, and enioys vol- 
leyball, travel, boating, and skiing He also serves on 
the board ol incorporators of the Franklin Children 
School and as head trustee lor the Bay Street Parking 
Trust 

Spencer M Ure 

Sfcrf Spence Ure is manufacturing 
I development engineering 
I manager at HP's Coloiado 
I Computer Manufacturing 
I Operation As new product 
H engineering manager tor the 
^ HP 9000 Series 700 work- 
stations, he was responsible 
for the worldwide manufac- 
turing introduction of these products Previously he 
had served in a similar capacity for lhe HP 9000 Mod 
els 825 and 835 and HP 3000 Series 925, 935, and 
945 computers With HP since 1955. when h8 |Oined 
lhe Colorado Springs Division, he also served as pro- 
duction engineering manager lor the HP 9000 Series 
500 and was in R&D for nine years Bom in Duluth, 
Minnesota he received his BS degree in mechanical 
engineering from the University nf Utah in 1965 
Stience is married and has four children and one 
grandchild He is involved m cub scouting, boy scout- 
ing, and church activities, and his interests include 
Dackpacking. photography birding. military history, 
anil railroading. 






55 Series 700 Entry Level 



John A. Dykstal 

v 

■son in 1988 ano his MSEE 
deg-ee from Stanford Uni- 
versity m 1989 Hesanaro- 
ware design engineer at 
HP's User Interlace Technol- 
ogy Division, which he 
pined in 1989 He has contributed to the hardware 
design of the Tu'boVRX graphics subsystem and the 
graphics subsystem for the HP 9000 Models 705 and 
710 workstations John is a native of Madison. 
Wisconsin His interests include skiing, Softball, 
volleyball, ana basketball 

Craig R. Frink 

Craig Frink is an engineer/ 
scientist at lie Advanced 
Systems Division of the HP 
Workstation Business Unit 
He has served as system 
architect and lead designer 
for the Apollo DN3000, 
DN4000, DN3500, and 
DN4500. the HP Apollo 
ON5500. and the HP 9000 Models 705 and 710 com- 
puters He has published papers on virtual cache ar- 
chitecture and design, system-level simulation, and 
workstation design He |oined Apollo Computer Sys- 
tems in 1984 and holds a BSM degree in manage- 
ment from Lesley college His interests include 
hockey, running, and archery. 

Robert J Hammond 

User Interlace Technology 
Division protect manager 
Bob Hammond was proiect 
manager for the multimedia 
audio system for the HP 
9000 Models 705 and 710 
computers He joined Apollo 
Computer Systems in 1987 
and has been a softwaie 
engineer and project manager for various Apollo 
graphics subsystems Before joining Apollo, he was a 
softwarn engineer with Compugraphic Corp Bob was 
hnrn in Allentnwn, Pennsylvania He received his BS 
degree in biochemistry from the University of Con- 
necticut in 1980 and studied electrical engineering 
and computet science at the University of New 
Hampshire He is married, has three children, and 
enioys hockey, basketball, scuba diving, and fishing 

Don C Soltis. Jr. 

I Don Soltis |0ineri HP's Fort 
^fflR^ - Collins Systems Division in 

W ' ■ ' 7 1 t as a manufac 

\ V|M luring deveiii, 

ueei, and then became an 
_^J^H R&D liardwan 

the 1/0 ASIC lor lhe HP 9000 
Models 425e and 475 computers and lhe voice- 
quality audio system tor the HP 9000 Models 705 
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and 710 computers He's now with the HP Measure- 
ment and Control Systems Division Don received Ins 
BSEE and MSEE degrees from Colorado State Univer- 
sity in 1984 and 1988 He is married and has three 
children His interests include fly-fishing, bicycling, 
skiing, and camping 

64 Color Print Cartridge 

Daniel A. Kearl 

Dan Kearl is a senior archi- 
tect at HP's Inkjet Compo- 
nents Division with interests 
in thin-film and microelec- 
tronic process development. 
He contributed to the Desk- 
Jet/OeskWriter color print 
cartridge architecture and 
testing A native of Bellevue. 
Washington, he graduated from Washington State 
University m 1980 with a BS degree in physical met- 
allurgical engineering and |oined HP's Loveland 
Technology Center the same year. He has done pro- 
cess development for IC assembly and packaging, for 
thin-film deposition for both precision resistor net- 
works and inkjat print cartridges, and for thermal 
irikjet barrier film imaging He serves on advisory 
boards for local school and college technology pro- 
grams, and lias a special interest in "antique technol- 
ogy" — steam engines, gas engines, tractors, and 
hydropnwer— collecting, restoring, and running the 
"old iron", and collecting books of the era His other 
interests include greenhouse gardening, sailing, old 
motor boats, and beer and wine making. 

Michael S. Ard 

Mike Ard is an R&D project 
manager al HP's Vancouver 
Division He was project 
manager foi development of 
_ I the firmware and electronics 

p — ' yf^L far the DeskJet 500C and 
DeskWnter C printers He 
received his BS and MS 
degrees In computer science 
from Brigham Young University in 1975 and 1978 and 
joined HP's Data Systems Division in 1976. moving to 
the Vancouver Division three years later Mike was 
born in St. Anthony. Idaho He is married and has 
seven children He coaches youth basketball and is 
involved in church activities 

69 Ink & Cartridge Development 

Craig Maze 

Craig Maze helped develop 
the inks for The DeskJet/ 
DeskWnter color prim car- 
tridge With HP since 1978. 
he has also worked on liquid 
crystal displays for calcula- 
tors, IC process develop- 
ment, and print cartridge 
architecture He received his 
BS degree in chemical engineering from Purdue Uni- 
versity in 1959 and his PhD m chemical engineering 
from Iowa State University in 1970 Before coming to 
HP. he worked on IC packaging ano liquid crystal 





displays for Motorola and on aerospace materials 
engineering for Martin-Manetta A member of the 
American Chemical Society and the American Institute 
of Chemical Engineers, he is named as an inventor on 
two patents related tD liquid crystal displays 

Loren E. Johnson 

An ink chemist at HP's Inkjet Components Division. 
Loren Johnson helped develop the inks for the Desk- 
Jet/DeskWnter color print cartridge. He has also 
worked on inks for previous DeskJet and ThinkJet 
products, on etch process development for the Think- 
Jet prmthead, on ThinkJet ink manufacturing, and on 
evaluation of thermal printer papers He joined HP at 
the Vancouver Division In 1981 A native of Rapid 
City, South Dakoia, he received Ins BS and MS de- 
grees in chemistry from the South Dakota School of 
Mutes and Technology in 1978 and 1 980 Six patents 
on mkjet ink compositions have resulted from his 
work Loren is married and has a daughter His inter- 
ests include hiking, fishing, computer gaming, xero- 
botany, drawing, science fiction, pottery, and cross- 
country skiing 

Daniel A. Kearl 

Author's biography appears elsewhere in this sectmn. 
James P. Shields 

^ ^£9^9 Jay Shields is an R&D chem- 

^ ist at HP's Inkjet Compo- 
nents Division. With HP 
since 1988. he was on the 
ink Development team for 
the DeskJet 500C/Desk- 
Writer C color print car- 
tridge He received Ins BS 
degree in chemistry from 
Bradley University in 1982 and his PhD degree in ana- 
lytical chemistry from Oregon State University in 1987 
Before coming to HP. he was a research chemist at 
Dow Chemical He has published several articles m 
the area of analytical plasma spectroscopy Jay is 
married and enioys woodworking, wine making, and 
a variety of outdoor sports 

77 Automated Assembly 

Lee S. Mason 

Lee Mason is a mechanical 
engineer at HP's Inkjet Com- 
ponents Division, specializ- 
ing in design for manufactur- 
ability, machine design, and 
automated assembly Born in 
Pittsburgh. Pennsylvania, he 
received his BS degree in 
mechanical engineering 
from the University of California at Davis In 198' and 
joined HP's Corvalhs Division the same year, serving 
as an R&D and production engineer for the HP 75C 
handheld computer From 1984 to 1987. he designed 
turnkey robotic assembly systems at Inteiledex Inc. 
After returning to HP, he did tool design and process 
development for the DeskJet/DeskWnter black and 
color print cartridges In 1992 he received an MBA 
degree from the University of Oregon. Lee is married 
and has two daughters Now that his MBA studies 
are completed, he hopes to have more time for gar- 
dening, bicycling, water skiing, and family activities. 







Manufacturing development 
engineer Mark Huth came to 
HP's Corvallis Division in 
1981. shortly after receiving 
his BSME degree from the 
Virginia Polytechnic Insti- 
tute He has developed auto- 
mated print cartridge manu- 
facturing processes for the 
PaintJet printers, and did pro- 
cess development and tool design for the DeskJet/ 
DeskWnter color print cartridge His professional 
interests include automated manufacturing and ma- 
chine vision Mark is married and has two sons. He is 
a rock climber and shares interests in soccer and 
Sumo wrestling with his sons 

84 Adhesive Technology 

Douglas J. Reed 

Doug Reed is a manufactur- 
ing development engineer at 
HP's Inkjet Components Divi- 
sion For the DeskJet/Desk- 
Writer color print cartridge, 
he worked in R&D on seal- 
ing surface development and 
in manufacturing on the 
adhesive dispense process 
and tooling. He also helped develop the adhesive 
dispense process and did tooling modification for the 
DeskJet black print cartridge Doug received his 
BSME degree from Oregon State Univetsity in 1987 
and joined HP the same year He is a member of the 
ASME and the Society ol Manufacturing Engineers 
Born m Elmira, New York, he is married, has three 
children, and is expecting a fourth He is involved in 
church leadership and enjoys fly-fishing, reading, and 
writing 

Terry M. Lambright 

Now with HP's Inkjet Com- 
ponents Division, mechani- 
cal development engineer 
Terry Lamonght joined the 
HP IC Business Division in 
1984 as an incoming materi- 
als engineer, later working 
on the encapsulation of TAB 
circuits for HP 20 Series cal- 
culators. For the DeskJet/DeskWnter color print car- 
tridge, his responsibilities included liaison with the 
printer development team, adhesive selection, and 
TAB circuit design. Before coming to HP he worked 
for hve years in the nuclear industry on materials 
decontamination and consolidation, and three years 
in the aerospace industry on the manufacture of tita- 
nium tubing He's the author of a paper on titanium 
alloy tube fabrication Born in Gabon. Ohio, he at- 
tended the University of Arizona, receiving a BS de- 
gree in metallurgical engineering m 1974 and an MS 
m materials science in 1976. He 15 married, has two 
children, is involved in church leadership, and holds 
the rank of captain in the U S Air Force Reserve His 
interests include marquetry, woodcarving, reading, 
and working with international students. 
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87 Machine Vision 



Michael J Monroe 

I tunng developmeni engineer 
at HPs lnk|et Components 
Dw3ion A specialist m opti- 
I C3' oesign. mactitne vision. 
| and control systems, he 
lOined the HP Corvallis Divi 
Sion in 1979 He has served 
as a production and service 
engineer for desktop computers and as an electronic 
tooling engineer tor portable computers and hand- 
held calculators, and developed the machine vision 
systems for DeskJet/DeskWnter color print cartridge 
production. Before joining HP he worked on the Viking 
Mars probe at the Jet Propulsion laboratory (NASAI 
and on electronic systems for missile launching sub- 
marines He was also associated with a computer 
design consulting firm He received his BSE degree in 
1971 from the University of California at Los Angeles, 
specialising in electronics engineering A member of 
the IEEE, he was born m Salt Lake City, Utah and 
served in the Air National Guard for sin years attain- 
ing the tank of sergeant He is married, has two 
daughters, and enjoys making telescopes, metal- 
working, and woodworking 

93 DeskWriter C Driver 



William J. Allen 

Will Allen was one of the 
developers of the software 
drivers for the DeskJet 500C 
and DeskWriter C color ink- 
jet printers. Born in Lafayette. 
Indiana, he received Ins BS 
degree in computer science 
from Purdue University in 
1982 After joining HP in 
Colorado in 1983, he developed himware for the HP 
16500 and 1630G logic analysers, then left HP briefly 
m 1987 to )Oin a startup company in Oregon, where 
he developed automated test equipment software 
Rejoining HP a few months later at the Vancouver 
Division, he served as a product support engineer 
and in 1989 moved to the R&D lab, where he lias 
worked on primer motion control himware and soft- 
ware dnveis Will is married, has three children, and 
coaches a youth soccer team His other interests in- 
clude travel, bicycling, swimming, table tennis and 
track woik at auto races 






Toni D Courville 

Software development engi- 
neer Toni Courville received 
her BS degree in computer 
science from Portland State 
University m 1988 With HP's 
Vancouver Division since 
'988. she ha* worked on 
firmware for the DeskJet 
500 and Oesk Winer printers, 
and helped develop the driver for tne DeskWriter C 
She is a native of Boise. Idaho and enjoys bicycling, 
skiing, and other outdoor activities 

Steven 0. Miller 

Steven Miller has done soft- 
ware and firmware engi- 
neering for HP's Vancouver 
Division since 1985. contrib- 
uting to the firmware and 
driver design of the Rugged- 
Writer. DeskWriter. and 
DeskWnter C printers A 
native of Believue. Washing- 
ton, he received BS degrees in computer engineering 
and computer science from Oregon State University 
in 1985. A patent on a data compression scheme 
names him as an inventor Steveii is married, has two 
children, and enjoys boardsailing, golf, skiing, and 
white-water rafting 



103 MRP Action Manager 



Alvina Y. Nishimoto 

An engineer/scientist at HP's 
Software Technology Divi- 
sion. Alvina Ntshimoto is 
responsible lor the develop- 
meni. enhancement, and 
suppon of manufacturing 
software products She 
| joined HP in 1978 after re 
ceiving both her BS and MS 
degrees in industrial engineering from Stanford Uni- 
versity that same year Her professional interests 
include software reuse and software developmeni 
processes Alvina was born in Honolulu, Hawaii 



William J. Gray 





the HP MRP Action Manager 
project at HP's Manufactur- 
ing Productivity Division 
(MPD). Bill Gray is curr&TfJy 



computer science from North Carolina State Univer- 
sity that same year Bill was bom m Macon. Georgia, 
is married, and has one child His recreational activi- 
ties include golf, bowling, soccer, and racauetball 

Barbara J. Williams 

1^^^^ I Currently a software 
JPRPfe^ I manufacturing engineering 
^f^^B^ I manage' 
■T.- \M 1 15 Division. Barbara 

& Williams was the project 

I manager for the HP MRP 
Manage project at 
HP's Manufacturing Produc- 
^^^^ tivity Division. Barb jo 
HP's Computer Systems Division in 1982 She worked 
as a developmeni engineer on the HP PM/3000 and 
HP MM/3000 products and as a project manager for 
the A 02 00 release of the HP Purchasing enhance- 
ments project While attending college she worked 
as a software engineer in a summer intern program 
at Bell Laboratories. She has a BS degree 11982) in 
computer science from Washington State University 
and she is certified in production and inventory man- 
agement ICPIM) by the American Production and In- 
ventory Control Society Managing people, processes, 
anrl projects are her mam professional interests She 
served as a United Way loaned executive for HP In 
1991 Bom in Wenaichee. Washington, she is married 
and expects her first child in the summer Sewing, 
aerobics, swimming, tennis, camping, waterskung. 
gardening, and volleyball are among her recreational 
activities 
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